VERB: Visualizing and Interpreting Bias Mitigation Techniques Geometrically for Word Representations

Rathore, Archit; Dev, Sunipa; Phillips, Jeff M.; Srikumar, Vivek; Zheng, Yan; Yeh, Chin-Chia Michael; Wang, Junpeng; Zhang, Wei; Wang, Bei

doi:10.1145/3604433

Citation Details

VERB: Visualizing and Interpreting Bias Mitigation Techniques Geometrically for Word Representations

Word vector embeddings have been shown to contain and amplify biases in the data they are extracted from. Consequently, many techniques have been proposed to identify, mitigate, and attenuate these biases in word representations. In this paper, we utilize interactive visualization to increase the interpretability and accessibility of a collection of state-of-the-art debiasing techniques. To aid this, we present the Visualization of Embedding Representations for deBiasing (“VERB”) system, an open-source web-based visualization tool that helps users gain a technical understanding and visual intuition of the inner workings of debiasing techniques, with a focus on their geometric properties. In particular, VERB offers easy-to-follow examples that explore the effects of these debiasing techniques on the geometry of high-dimensional word vectors. To help understand how various debiasing techniques change the underlying geometry, VERB decomposes each technique into interpretable sequences of primitive transformations and highlights their effect on the word vectors using dimensionality reduction and interactive visual exploration. VERB is designed to target natural language processing (NLP) practitioners who are designing decision-making systems on top of word embeddings, and also researchers working with the fairness and ethics of machine learning systems in NLP. It can also serve as a visual medium for education, which helps an NLP novice understand and mitigate biases in word embeddings. more »

Award ID(s):: 2134223 1661375 2205418

PAR ID:: 10421447

Author(s) / Creator(s):: Rathore, Archit; Dev, Sunipa; Phillips, Jeff M.; Srikumar, Vivek; Zheng, Yan; Yeh, Chin-Chia Michael; Wang, Junpeng; Zhang, Wei; Wang, Bei

Date Published:: 2023-06-22

Journal Name:: ACM Transactions on Interactive Intelligent Systems

ISSN:: 2160-6455

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1145/3604433

More Like this