Towards Understanding Gender Bias in Relation Extraction

Gaut, Andrew; Sun, Tony; Tang, Shirlyn; Huang, Yuxin; Qian, Jing; ElSherief, Mai; Zhao, Jieyu; Mirza, Diba; Belding, Elizabeth; Chang, Kai-Wei; Wang, William Yang

doi:10.18653/v1/2020.acl-main.265

Citation Details

Towards Understanding Gender Bias in Relation Extraction

Recent developments in Neural Relation Extraction (NRE) have made significant strides towards Automated Knowledge Base Construction. While much attention has been dedicated towards improvements in accuracy, there have been no attempts in the literature to evaluate social biases exhibited in NRE systems. In this paper, we create WikiGenderBias, a distantly supervised dataset composed of over 45,000 sentences including a 10% human annotated test set for the purpose of analyzing gender bias in relation extraction systems. We find that when extracting spouse-of and hypernym (i.e., occupation) relations, an NRE system performs differently when the gender of the target entity is different. However, such disparity does not appear when extracting relations such as birthDate or birthPlace. We also analyze how existing bias mitigation techniques, such as name anonymization, word embedding debiasing, and data augmentation affect the NRE system in terms of maintaining the test performance and reducing biases. Unfortunately, due to NRE models rely heavily on surface level cues, we find that existing bias mitigation approaches have a negative effect on NRE. Our analysis lays groundwork for future quantifying and mitigating bias in NRE. more »

Award ID(s):: 1927554

PAR ID:: 10192194

Author(s) / Creator(s):: Gaut, Andrew; Sun, Tony; Tang, Shirlyn; Huang, Yuxin; Qian, Jing; ElSherief, Mai; Zhao, Jieyu; Mirza, Diba; Belding, Elizabeth; Chang, Kai-Wei; Wang, William Yang

Date Published:: 2020-01-01

Journal Name:: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

Page Range / eLocation ID:: 2943 to 2953

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2020.acl-main.265

More Like this