Counterfactually Fair Representation

Zuo, Zhiqun; Khalili, Mohammad_Mahdi; Zhang, Xueru

Citation Details

The use of machine learning models in high-stake applications (e.g., healthcare, lending, college admission) has raised growing concerns due to potential biases against protected social groups. Various fairness notions and methods have been proposed to mitigate such biases. In this work, we focus on Counterfactual Fairness (CF), a fairness notion that is dependent on an underlying causal graph and first proposed by Kusner et al. (2017); it requires that the outcome an individual perceives is the same in the real world as it would be in a "counterfactual" world, in which the individual belongs to another social group. Learning fair models satisfying CF can be challenging. It was shown in (Kusner et al. 2017) that a sufficient condition for satisfying CF is to not use features that are descendants of sensitive attributes in the causal graph. This implies a simple method that learns CF models only using non-descendants of sensitive attributes while eliminating all descendants. Although several subsequent works proposed methods that use all features for training CF models, there is no theoretical guarantee that they can satisfy CF. In contrast, this work proposes a new algorithm that trains models using all the available features. We theoretically and empirically show that models trained with this method can satisfy CF. more »

Award ID(s):: 2202699

PAR ID:: 10534984

Author(s) / Creator(s):: Zuo, Zhiqun; Khalili, Mohammad_Mahdi; Zhang, Xueru

Publisher / Repository:: Proceedings of the 37th International Conference on Neural Information Processing Systems

Date Published:: 2024-05-30

Format(s):: Medium: X

Location:: Proceedings of the 37th International Conference on Neural Information Processing Systems

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this