Gender Slopes: Counterfactual Fairness for Computer Vision Models by Attribute Manipulation

Joo, Jungseock; Kärkkäinen, Kimmo

doi:10.1145/3422841.3423533

Citation Details

Gender Slopes: Counterfactual Fairness for Computer Vision Models by Attribute Manipulation

Automated computer vision systems have been applied in many domains including security, law enforcement, and personal devices, but recent reports suggest that these systems may produce biased results, discriminating against people in certain demographic groups. Diagnosing and understanding the underlying true causes of model biases, however, are challenging tasks because modern computer vision systems rely on complex black-box models whose behaviors are hard to decode. We propose to use an encoder-decoder network developed for image attribute manipulation to synthesize facial images varying in the dimensions of gender and race while keeping other signals intact. We use these synthesized images to measure counterfactual fairness of commercial computer vision classifiers by examining the degree to which these classifiers are affected by gender and racial cues controlled in the images, e.g., feminine faces may elicit higher scores for the concept of nurse and lower scores for STEM-related concepts. more »

Award ID(s):: 1831848

PAR ID:: 10299105

Author(s) / Creator(s):: Joo, Jungseock; Kärkkäinen, Kimmo

Date Published:: 2020-10-12

Journal Name:: FATE/MM '20: Proceedings of the 2nd International Workshop on Fairness, Accountability, Transparency and Ethics in Multimedia

Page Range / eLocation ID:: 1 to 5

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3422841.3423533

More Like this