NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Control Large Language Models via Divide and Conquer

Li, Bingxuan; Wang, Yiwei; Meng, Tao; Chang, Kai-Wei; Peng, Nanyun (November 2024, The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP))

Full Text Available
On the Paradox of Learning to Reason from Data

Zhang, Honghua; Li, Liunian Harold; Meng, Tao; Chang, Kai-Wei; Van den Broeck, Guy (August 2023, Proceedings of the 32nd International Joint Conference on Artificial Intelligence (IJCAI))

Full Text Available
On the Robustness of Language Encoders against Grammatical Errors

https://doi.org/10.18653/v1/2020.acl-main.310

Yin, Fan; Long, Quanyu; Meng, Tao; Chang, Kai-Wei (January 2020, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics)

We conduct a thorough study to diagnose the behaviors of pre-trained language encoders (ELMo, BERT, and RoBERTa) when confronted with natural grammatical errors. Specifically, we collect real grammatical errors from non-native speakers and conduct adversarial attacks to simulate these errors on clean text data. We use this approach to facilitate debugging models on downstream applications. Results confirm that the performance of all tested models is affected but the degree of impact varies. To interpret model behaviors, we further design a linguistic acceptability task to reveal their abilities in identifying ungrammatical sentences and the position of errors. We find that fixed contextual encoders with a simple classifier trained on the prediction of sentence correctness are able to locate error positions. We also design a cloze test for BERT and discover that BERT captures the interaction between errors and specific tokens in context. Our results shed light on understanding the robustness and behaviors of language encoders against grammatical errors.
more » « less
Full Text Available
Mitigating Gender Bias Amplification in Distribution by Posterior Regularization

https://doi.org/10.18653/v1/2020.acl-main.264

Jia, Shengyu; Meng, Tao; Zhao, Jieyu; Chang, Kai-Wei (January 2020, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics)

Advanced machine learning techniques have boosted the performance of natural language processing. Nevertheless, recent studies, e.g., Zhao (2017) show that these techniques inadvertently capture the societal bias hidden in the corpus and further amplify it. However, their analysis is conducted only on models’ top predictions. In this paper, we investigate the gender bias amplification issue from the distribution perspective and demonstrate that the bias is amplified in the view of predicted probability distribution over labels. We further propose a bias mitigation approach based on posterior regularization. With little performance loss, our method can almost remove the bias amplification in the distribution. Our study sheds the light on understanding the bias amplification.
more » « less
Full Text Available
Target Language-Aware Constrained Inference for Cross-lingual Dependency Parsing

https://doi.org/10.18653/v1/D19-1103

Meng, Tao; Peng, Nanyun; Chang, Kai-Wei (January 2019, Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP))

Prior work on cross-lingual dependency parsing often focuses on capturing the commonalities between source and target languages and overlook the potential to leverage the linguistic properties of the target languages to facilitate the transfer. In this paper, we show that weak supervisions of linguistic knowledge for the target languages can improve a cross-lingual graph-based dependency parser substantially. Specifically, we explore several types of corpus linguistic statistics and compile them into corpus-statistics constraints to facilitate the inference procedure. We propose new algorithms that adapt two techniques, Lagrangian relaxation and posterior regularization, to conduct inference with corpus-statistics constraints. Experiments show that the Lagrangian relaxation and posterior regularization techniques improve the performances on 15 and 17 out of 19 target languages, respectively. The improvements are especially large for the target languages that have different word order features from the source language.
more » « less
Full Text Available

Search for: All records