NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Sorting through the noise: Testing robustness of information processing in pre-trained language models

https://doi.org/10.18653/v1/2021.emnlp-main.119

Pandia, L.; Ettinger, A. (January 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing)

Full Text Available
On the Interplay Between Fine-tuning and Composition in Transformers

Yu, Lang; Ettinger, Allyson (January 2021, Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021)
null (Ed.)
Full Text Available
Pragmatic competence of pre-trained language models through the lens of discourse connectives

https://doi.org/10.18653/v1/2021.conll-1.29

Pandia, L.; Cong, Y.; Ettinger, A. (January 2021, Proceedings of the 25th Conference on Computational Natural Language Learning (CoNLL))

Full Text Available
Spying on your neighbors: Fine-grained probing of contextual embeddings for information about surrounding words

Klafka, Josef; Ettinger, Allyson (July 2020, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics)

Although models using contextual word embeddings have achieved state-of-the-art results on a host of NLP tasks, little is known about exactly what information these embeddings encode about the context words that they are understood to reflect. To address this question, we introduce a suite of probing tasks that enable fine-grained testing of contextual embeddings for encoding of information about surrounding words. We apply these tasks to examine the popular BERT, ELMo and GPT contextual encoders, and find that each of our tested information types is indeed encoded as contextual information across tokens, often with near-perfect recoverability -- but the encoders vary in which features they distribute to which tokens, how nuanced their distributions are, and how robust the encoding of each feature is to distance. We discuss implications of these results for how different types of models break down and prioritize word-level context information when constructing token embeddings.
more » « less
Full Text Available
PeTra: A Sparsely Supervised Memory Model for People Tracking

https://doi.org/10.18653/v1/2020.acl-main.481

Toshniwal, Shubham; Ettinger, Allyson; Gimpel, Kevin; Livescu, Karen (July 2020, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics)

We propose PeTra, a memory-augmented neural network designed to track entities in its memory slots. PeTra is trained using sparse annotation from the GAP pronoun resolution dataset and outperforms a prior memory model on the task while using a simpler architecture. We empirically compare key modeling choices, finding that we can simplify several aspects of the design of the memory module while retaining strong performance. To measure the people tracking capability of memory models, we (a) propose a new diagnostic evaluation based on counting the number of unique entities in text, and (b) conduct a small scale human evaluation to compare evidence of people tracking in the memory logs of PeTra relative to a previous approach. PeTra is highly effective in both evaluations, demonstrating its ability to track people in its memory despite being trained with limited annotation.
more » « less
Full Text Available
Assessing Phrasal Representation and Composition in Transformers

Yu, Lang; Ettinger, Allyson (January 2020, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing)
null (Ed.)
Full Text Available
Learning to Ignore: Long Document Coreference with Bounded Memory Neural Networks

Toshniwal, Shubham; Wiseman, Sam; Ettinger, Allyson; Livescu, Karen; Gimpel, Kevin (January 2020, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing)
null (Ed.)
Full Text Available

Search for: All records