NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Testing Causal Models of Word Meaning in LLMs

Musker, Samuel; Pavlick, Ellie (July 2024, Proceedings of the Annual Meeting of the Cognitive Science Society Volume 46)

Large Language Models (LLMs) have driven extraordinary improvements in NLP. However, it is unclear how such models represent lexical concepts-i.e., the meanings of the words they use. We evaluate the lexical representations of GPT-4, GPT-3, and Falcon-40B through the lens of HIPE theory, a concept representation theory focused on words describing artifacts (such as ‚Äúmop‚Äù, ‚Äúpencil‚Äù, and ‚Äúwhistle‚Äù). The theory posits a causal graph relating the meanings of such words to the form, use, and history of the referred objects. We test LLMs with the stimuli used by Chaigneau et al. (2004) on human subjects, and consider a variety of prompt designs. Our experiments concern judgements about causal outcomes, object function, and object naming. We do not find clear evidence that GPT-3 or Falcon-40B encode HIPE's causal structure, but find evidence that GPT-4 does. The results contribute to a growing body of research characterizing the representational capacity of LLMs.
more » « less
Full Text Available
CroCoSum: A Benchmark Dataset for Cross-Lingual Code-Switched Summarization

Zhang, Ruochen; Eickhoff, Carsten (May 2024, 2024 ELRA Language Resource Association:)

Cross-lingual summarization (CLS) has attracted increasing interest in recent years due to the availability of large-scale web-mined datasets and the advancements of multilingual language models. However, given the rareness of naturally occurring CLS resources, the majority of datasets are forced to rely on translation which can contain overly literal artifacts. This restricts our ability to observe naturally occurring CLS pairs that capture organic diction, including instances of code-switching. This alteration between languages in mid-message is a common phenomenon in multilingual settings yet has been largely overlooked in cross-lingual contexts due to data scarcity. To address this gap, we introduce CroCoSum, a dataset of cross-lingual code-switched summarization of technology news. It consists of over 24,000 English source articles and 18,000 human-written Chinese news summaries, with more than 92% of the summaries containing code-switched phrases. For reference, we evaluate the performance of existing approaches including pipeline, end-to-end, and zero-shot methods. We show that leveraging existing CLS resources as a pretraining step does not improve performance on CroCoSum, indicating the limited generalizability of current datasets. Finally, we discuss the challenges of evaluating cross-lingual summarizers on code-switched generation through qualitative error analyses.
more » « less
Full Text Available
Does CLIP Bind Concepts? Probing Compositionality in Large Image Models

Lewis, Martha; Nayak, Nihal V; Yu, Peilin; Yu, Qinan; Merullo, Jack; Bach, Stephen H; Pavlick, Ellie (March 2024, Findings of the Association for Computational Linguistics: EACL 2024)

Large-scale neural network models combining text and images have made incredible progress in recent years. However, it remains an open question to what extent such models encode compositional representations of the concepts over which they operate, such as correctly identifying red cube by reasoning over the constituents red and cube. In this work, we focus on the ability of a large pretrained vision and language model (CLIP) to encode compositional concepts and to bind variables in a structure-sensitive way (e.g., differentiating cube behind sphere from sphere behind cube). To inspect the performance of CLIP, we compare several architectures from research on compositional distributional semantics models (CDSMs), a line of research that attempts to implement traditional compositional linguistic structures within embedding spaces. We benchmark them on three synthetic datasets– singleobject, two-object, and relational– designed to test concept binding. We find that CLIP can compose concepts in a single-object setting, but in situations where concept binding is needed, performance drops dramatically. At the same time, CDSMs also perform poorly, with best performance at chance level.
more » « less
Full Text Available
CIRCUIT COMPONENT REUSE ACROSS TASKS IN TRANSFORMER LANGUAGE MODELS

Merullo, Jack; Eickhoff, Carsten; Pavlick, Ellie (January 2024, The Twelfth International Conference on Learning Representations)

Recent work in mechanistic interpretability has shown that behaviors in language models can be successfully reverse-engineered through circuit analysis. A com- mon criticism, however, is that each circuit is task-specific, and thus such analysis cannot contribute to understanding the models at a higher level. In this work, we present evidence that insights (both low-level findings about specific heads and higher-level findings about general algorithms) can indeed generalize across tasks. Specifically, we study the circuit discovered in Wang et al. (2022) for the Indirect Object Identification (IOI) task and 1.) show that it reproduces on a larger GPT2 model, and 2.) that it is mostly reused to solve a seemingly different task: Colored Objects (Ippolito & Callison-Burch, 2023). We provide evidence that the process underlying both tasks is functionally very similar, and contains about a 78% overlap in in-circuit attention heads. We further present a proof-of-concept intervention experiment, in which we adjust four attention heads in middle layers in order to ‘repair’ the Colored Objects circuit and make it behave like the IOI circuit. In doing so, we boost accuracy from 49.6% to 93.7% on the Colored Ob- jects task and explain most sources of error. The intervention affects downstream attention heads in specific ways predicted by their interactions in the IOI circuit, indicating that this subcircuit behavior is invariant to the different task inputs. Overall, our results provide evidence that it may yet be possible to explain large language models’ behavior in terms of a relatively small number of interpretable task-general algorithmic building blocks and computational components
more » « less
Full Text Available
Language Models Implement Simple Word2Vec-style Vector Arithmetic

https://doi.org/10.18653/v1/2024.naacl-long.281

Merullo, Jack; Eickhoff, Carsten; Pavlick, Ellie (January 2024, Association for Computational Linguistics)

Full Text Available
Are Language Models Worse than Humans at Following Prompts? It’s Complicated

https://doi.org/10.18653/v1/2023.findings-emnlp.514

Webson, Albert; Loo, Alyssa; Yu, Qinan; Pavlick, Ellie (January 2023, Association for Computational Linguistics)

Full Text Available
Enhancing the Ranking Context of Dense Retrieval through Reciprocal Nearest Neighbors

https://doi.org/10.18653/v1/2023.emnlp-main.665

Zerveas, George; Rekabsaz, Navid; Eickhoff, Carsten (January 2023, Association for Computational Linguistics)

Full Text Available
Mitigating Bias in Search Results through Set-based Document Reranking and Neutrality Regularization

George Zerveas, Navid Rekabsaz (July 2022, ACM SIGIR 2022)

Full Text Available
Inconsistent Ranking Assumptions in Medical Search and Their Downstream Consequences

Daniel Cohen, Kevin Du (July 2022, ACM SIGIR 2022)

Full Text Available
NEWTS: A Corpus for News Topic-Focused Summarization

Seyed Ali Bahrainian, Sheridan Feucht (May 2022, Findings of ACL 2022)

Full Text Available

« Prev Next »

Search for: All records