Counterfactual Fairness: Unidentification, Bound and Algorithm

Wu, Yongkai; Zhang, Lu; Wu, Xintao

doi:10.24963/ijcai.2019/199

Citation Details

Counterfactual Fairness: Unidentification, Bound and Algorithm

Fairness-aware learning studies the problem of building machine learning models that are subject to fairness requirements. Counterfactual fairness is a notion of fairness derived from Pearl's causal model, which considers a model is fair if for a particular individual or group its prediction in the real world is the same as that in the counterfactual world where the individual(s) had belonged to a different demographic group. However, an inherent limitation of counterfactual fairness is that it cannot be uniquely quantified from the observational data in certain situations, due to the unidentifiability of the counterfactual quantity. In this paper, we address this limitation by mathematically bounding the unidentifiable counterfactual quantity, and develop a theoretically sound algorithm for constructing counterfactually fair classifiers. We evaluate our method in the experiments using both synthetic and real-world datasets, as well as compare with existing methods. The results validate our theory and show the effectiveness of our method. more »

Award ID(s):: 1646654

PAR ID:: 10126321

Author(s) / Creator(s):: Wu, Yongkai; Zhang, Lu; Wu, Xintao

Date Published:: 2019-08-10

Journal Name:: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence

Page Range / eLocation ID:: 1438 to 1444

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.24963/ijcai.2019/199

More Like this