Adversarial Fairness Network

Jang, Taeuk; Wang, Xiaoqian; Huang, Heng

doi:10.1609/aaai.v38i20.30220

Citation Details

Adversarial Fairness Network

Fairness is becoming a rising concern in machine learning. Recent research has discovered that state-of-the-art models are amplifying social bias by making biased prediction towards some population groups (characterized by sensitive features like race or gender). Such unfair prediction among groups renders trust issues and ethical concerns in machine learning, especially for sensitive fields such as employment, criminal justice, and trust score assessment. In this paper, we introduce a new framework to improve machine learning fairness. The goal of our model is to minimize the influence of sensitive feature from the perspectives of both data input and predictive model. To achieve this goal, we reformulate the data input by eliminating the sensitive information and strengthen model fairness by minimizing the marginal contribution of the sensitive feature. We propose to learn the sensitive-irrelevant input via sampling among features and design an adversarial network to minimize the dependence between the reformulated input and the sensitive information. Empirical results validate that our model achieves comparable or better results than related state-of-the-art methods w.r.t. both fairness metrics and prediction performance. more »

Award ID(s):: 2146091

PAR ID:: 10525248

Author(s) / Creator(s):: Jang, Taeuk; Wang, Xiaoqian; Huang, Heng

Publisher / Repository:: Proceedings of the AAAI Conference on Artificial Intelligence

Date Published:: 2024-03-25

Journal Name:: Proceedings of the AAAI Conference on Artificial Intelligence

Volume:: 38

Issue:: 20

ISSN:: 2159-5399

Page Range / eLocation ID:: 22159 to 22166

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1609/aaai.v38i20.30220

More Like this