Robust Natural Language Understanding with Residual Attention Debiasing

Wang, Fei; Huang, James Y.; Yan, Tianyi; Zhou, Wenxuan; Chen, Muhao

doi:10.18653/v1/2023.findings-acl.32

Citation Details

Robust Natural Language Understanding with Residual Attention Debiasing

Natural language understanding (NLU) models often suffer from unintended dataset biases. Among bias mitigation methods, ensemble-based debiasing methods, especially product-of-experts (PoE), have stood out for their impressive empirical success. However, previous ensemble-based debiasing methods typically apply debiasing on top-level logits without directly addressing biased attention patterns. Attention serves as the main media of feature interaction and aggregation in PLMs and plays a crucial role in providing robust prediction. In this paper, we propose REsidual Attention Debiasing (READ), an end-to-end debiasing method that mitigates unintended biases from attention. Experiments on three NLU benchmarks show that READ significantly improves the OOD performance of BERT-based models, including +12.9% accuracy on HANS, +11.0% accuracy on FEVER-Symmetric, and +2.7% F1 on PAWS. Detailed analyses demonstrate the crucial role of unbiased attention in robust NLU models and that READ effectively mitigates biases in attention. more »

Award ID(s):: 2105329

PAR ID:: 10440672

Author(s) / Creator(s):: Wang, Fei; Huang, James Y.; Yan, Tianyi; Zhou, Wenxuan; Chen, Muhao

Date Published:: 2023-01-01

Journal Name:: Findings of the Association for Computational Linguistics: ACL 2023

Page Range / eLocation ID:: 504 to 519

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2023.findings-acl.32

More Like this