Breaking Fair Binary Classification with Optimal Flipping Attacks

Jo, Changhun; Sohn, Jy-Yong; Lee, Kangwook

doi:10.1109/ISIT50566.2022.9834475

Citation Details

Breaking Fair Binary Classification with Optimal Flipping Attacks

Minimizing risk with fairness constraints is one of the popular approaches to learning a fair classifier. Recent works showed that this approach yields an unfair classifier if the training set is corrupted. In this work, we study the minimum amount of data corruption required for a successful flipping attack. First, we find lower/upper bounds on this quantity and show that these bounds are tight when the target model is the unique unconstrained risk minimizer. Second, we propose a computationally efficient data poisoning attack algorithm that can compromise the performance of fair learning algorithms. more »

Award ID(s):: 2003129

PAR ID:: 10395562

Author(s) / Creator(s):: Jo, Changhun; Sohn, Jy-Yong; Lee, Kangwook

Date Published:: 2022-06-26

Journal Name:: 2022 IEEE International Symposium on Information Theory (ISIT)

Page Range / eLocation ID:: 1453 to 1458

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ISIT50566.2022.9834475

More Like this