Socially Aware Bias Measurements for Hindi Language Representations

Malik, Vijit; Dev, Sunipa; Nishi, Akihiro; Peng, Nanyun; Chang, Kai-Wei

doi:10.18653/v1/2022.naacl-main.76

Citation Details

Socially Aware Bias Measurements for Hindi Language Representations

Language representations are an efficient tool used across NLP, but they are strife with encoded societal biases. These biases are studied extensively, but with a primary focus on English language representations and biases common in the context of Western society. In this work, we investigate the biases present in Hindi language representations such as caste and religion associated biases. We demonstrate how biases are unique to specific language representations based on the history and culture of the region they are widely spoken in, and also how the same societal bias (such as binary gender associated biases) when investigated across languages is encoded by different words and text spans. With this work, we emphasize on the necessity of social-awareness along with linguistic and grammatical artefacts when modeling language representations, in order to understand the biases encoded. more »

Award ID(s):: 1927554

PAR ID:: 10391939

Author(s) / Creator(s):: Malik, Vijit; Dev, Sunipa; Nishi, Akihiro; Peng, Nanyun; Chang, Kai-Wei

Date Published:: 2022-01-01

Journal Name:: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

Page Range / eLocation ID:: 1041 to 1052

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2022.naacl-main.76

More Like this