On the Vulnerability of Fairness Constrained Learning to Malicious Noise

Blum, Avrim; Okoroafor, Princewill; Saha, Aadirupa; Stangl, Kevin M

Citation Details

We consider the vulnerability of fairness-constrained learning to malicious noise in the training data. Konstantinov and Lampert (2021) initiated the study of this question and proved that any proper learner can exhibit high vulnerability when group sizes are imbalanced. Here, we present a more optimistic view, showing that if we allow randomized classifiers, then the landscape is much more nuanced. For example, for Demographic Parity we need only incur a Θ(α) loss in accuracy, where α is the malicious noise rate, matching the best possible even without fairness constraints. For Equal Opportunity, we show we can incur an O(sqrt(α)) loss, and give a matching Ω(sqrt(α)) lower bound. For Equalized Odds and Predictive Parity, however, and adversary can indeed force an Ω(1) loss. The key technical novelty of our work is how randomization can bypass simple 'tricks' an adversary can use to amplify its power. These results provide a more fine-grained view of the sensitivity of fairness-constrained learning to adversarial noise in training data. more »

Award ID(s):: 2212968

PAR ID:: 10511433

Author(s) / Creator(s):: Blum, Avrim; Okoroafor, Princewill; Saha, Aadirupa; Stangl, Kevin M

Publisher / Repository:: Proceedings of Machine Learning Research

Date Published:: 2024-05-02

Journal Name:: 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024)

Subject(s) / Keyword(s):: Fairness Adversarial machine learning

Format(s):: Medium: X

Location:: Valencia, Spain

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this