On Improving Fairness of AI Models with Synthetic Minority Oversampling Techniques

Zhou, Yan; Kantarcioglu, Murat; Clifton, Chris

Citation Details

Biased AI models result in unfair decisions. In response, a number of algorithmic solutions have been engineered to mitigate bias, among which the Synthetic Minority Oversampling Technique (SMOTE) has been studied, to an extent. Although the SMOTE technique and its variants have great potentials to help improve fairness, there is little theoretical justification for its success. In addition, formal error and fairness bounds are not clearly given. This paper attempts to address both issues. We prove and demonstrate that synthetic data generated by oversampling underrepresented groups can mitigate algorithmic bias in AI models, while keeping the predictive errors bounded. We further compare this technique to the existing state-of-the-art fair AI techniques on five datasets using a variety of fairness metrics. We show that this approach can effectively improve fairness even when there is a significant amount of label and selection bias, regardless of the baseline AI algorithm. more »

Award ID(s):: 1939728

PAR ID:: 10549626

Author(s) / Creator(s):: Zhou, Yan; Kantarcioglu, Murat; Clifton, Chris

Publisher / Repository:: Society for Industrial and Applied Mathematics

Date Published:: 2023-04-27

ISBN:: 978-1-61197-765-3

Page Range / eLocation ID:: 874 - 882

Subject(s) / Keyword(s):: AI fairness sensitive feature synthetic data SMOTE

Format(s):: Medium: X

Location:: Minneapolis-St. Paul Twin Cities, MN, USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this