NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Testing the Ability of Language Models to Interpret Figurative Language

https://doi.org/10.18653/v1/2022.naacl-main.330

Liu, Emmy; Cui, Chenxuan; Zheng, Kenneth; Neubig, Graham (July 2022, Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies)

Figurative and metaphorical language are commonplace in discourse, and figurative expressions play an important role in communication and cognition. However, figurative language has been a relatively under-studied area in NLP, and it remains an open question to what extent modern language models can interpret nonliteral phrases. To address this question, we introduce Fig-QA, a Winograd-style nonliteral language understanding task consisting of correctly interpreting paired figurative phrases with divergent meanings. We evaluate the performance of several state-of-the-art language models on this task, and find that although language models achieve performance significantly over chance, they still fall short of human performance, particularly in zero- or few-shot settings. This suggests that further work is needed to improve the nonliteral reasoning capabilities of language models.
more » « less
Full Text Available
Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation

https://doi.org/10.18653/v1/2022.acl-long.61

Wang, Xinyi; Ruder, Sebastian; Neubig, Graham (January 2022, Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers))

The performance of multilingual pretrained models is highly dependent on the availability of monolingual or parallel text present in a target language. Thus, the majority of the world’s languages cannot benefit from recent progress in NLP as they have no or limited textual data. To expand possibilities of using NLP technology in these under-represented languages, we systematically study strategies that relax the reliance on conventional language resources through the use of bilingual lexicons, an alternative resource with much better language coverage. We analyze different strategies to synthesize textual or labeled data using lexicons, and how this data can be combined with monolingual or parallel text when available. For 19 under-represented languages across 3 tasks, our methods lead to consistent improvements of up to 5 and 15 points with and without extra monolingual text respectively. Overall, our study highlights how NLP methods can be adapted to thousands more languages that are under-served by current technology.
more » « less
Full Text Available
Reducing Confusion in Active Learning for Part-Of-Speech Tagging

https://doi.org/10.1162/tacl_a_00350

Chaudhary, Aditi; Anastasopoulos, Antonios; Sheikh, Zaid; Neubig, Graham (February 2021, Transactions of the Association for Computational Linguistics)
null (Ed.)
Active learning (AL) uses a data selection algorithm to select useful training samples to minimize annotation cost. This is now an essential tool for building low-resource syntactic analyzers such as part-of-speech (POS) taggers. Existing AL heuristics are generally designed on the principle of selecting uncertain yet representative training instances, where annotating these instances may reduce a large number of errors. However, in an empirical study across six typologically diverse languages (German, Swedish, Galician, North Sami, Persian, and Ukrainian), we found the surprising result that even in an oracle scenario where we know the true uncertainty of predictions, these current heuristics are far from optimal. Based on this analysis, we pose the problem of AL as selecting instances that maximally reduce the confusion between particular pairs of output tags. Extensive experimentation on the aforementioned languages shows that our proposed AL strategy outperforms other AL strategies by a significant margin. We also present auxiliary results demonstrating the importance of proper calibration of models, which we ensure through cross-view training, and analysis demonstrating how our proposed strategy selects examples that more closely follow the oracle data distribution. The code is publicly released here. 1
more » « less
Full Text Available
When is Wall a Pared and when a Muro?: Extracting Rules Governing Lexical Selection

https://doi.org/10.18653/v1/2021.emnlp-main.553

Chaudhary, Aditi; Yin, Kayo; Anastasopoulos, Antonios; Neubig, Graham (January 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing)

Full Text Available
Explicit Alignment Objectives for Multilingual Bidirectional Encoders

https://doi.org/10.18653/v1/2021.naacl-main.284

Hu, Junjie; Johnson, Melvin; Firat, Orhan; Siddhant, Aditya; Neubig, Graham (January 2021, Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies)

Full Text Available
Evaluating the Morphosyntactic Well-formedness of Generated Texts

https://doi.org/10.18653/v1/2021.emnlp-main.570

Pratapa, Adithya; Anastasopoulos, Antonios; Rijhwani, Shruti; Chaudhary, Aditi; Mortensen, David R.; Neubig, Graham; Tsvetkov, Yulia (January 2021, Evaluating the Morphosyntactic Well-formedness of Generated Texts)

Text generation systems are ubiquitous in natural language processing applications. However, evaluation of these systems remains a challenge, especially in multilingual settings. In this paper, we propose L’AMBRE – a metric to evaluate the morphosyntactic well-formedness of text using its dependency parse and morphosyntactic rules of the language. We present a way to automatically extract various rules governing morphosyntax directly from dependency treebanks. To tackle the noisy outputs from text generation systems, we propose a simple methodology to train robust parsers. We show the effectiveness of our metric on the task of machine translation through a diachronic study of systems translating into morphologically-rich languages.
more » « less
Full Text Available
Phoneme Recognition through Fine Tuning of Phonetic Representations: a Case Study on Luhya Language Varieties

Siminyu, Kathleen; Li, Xinjian; Anastasopoulos, Antonios; Mortensen, David R.; Marlo, Michael; Neubig, Graham (January 2021, 22nd Annual Conference of the International Speech Communication Association (InterSpeech 2021))
null (Ed.)
Full Text Available
Efficient Test Time Adapter Ensembling for Low-resource Language Varieties

https://doi.org/10.18653/v1/2021.findings-emnlp.63

Wang, Xinyi; Tsvetkov, Yulia; Ruder, Sebastian; Neubig, Graham (January 2021, Findings of the Association for Computational Linguistics: EMNLP 2021)

Full Text Available
Lexically-Aware Semi-Supervised Learning for OCR Post-Correction

Rijhwani, Shruti; Rosenblum, Daisy; Anastasopoulos, Antonios; Neubig, Graham (January 2021, Transactions of the Association for Computational Linguistics)
null (Ed.)
Full Text Available
The CMU-LTI submission to the SIGMORPHON 2020 Shared Task 0: Language-Specific Cross-Lingual Transfer

Murikinati, Nikitha; Anastasopoulos, Antonios (July 2020, Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology)

Full Text Available

« Prev Next »

Search for: All records