NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The Surprising Effectiveness of Test-Time Training for Few-Shot Learning

Akyürek, Ekin; Damani, Mehlu; Zweiger, Adam; Qiu, Linlu; Guo, Han; Pari, Jyothish; Kim, Yoon; Andreas, Jacob (July 2025, International Conference on Machine Learning)

Free, publicly-accessible full text available July 13, 2026
Eliciting Human Preferences with Language Models

Li, Belinda; Tamkin, Alex; Goodman, Noah; Andreas, Jacob (April 2025, International Conference on Learning Representations)

Free, publicly-accessible full text available April 24, 2026
Inspecting and Editing Knowledge Representations in Language Models

Hernandez, Evan; Li, Belinda; Andreas, Jacob (October 2024, Conference on Language Models)

Neural language models (LMs) represent facts about the world described by text. Sometimes these facts derive from training data (in most LMs, a representation of the word banana encodes the fact that bananas are fruits). Sometimes facts derive from input text itself (a representation of the sentence I poured out the bottle encodes the fact that the bottle became empty). We describe REMEDI, a method for learning to map statements in natural language to fact encodings in an LM’s internal representation system. REMEDI encodings can be used as knowledge editors: when added to LM hidden representations, they modify downstream generation to be consistent with new facts. REMEDI encodings may also be used as probes: when compared to LM representations, they reveal which properties LMs already attribute to mentioned entities, in some cases making it possible to predict when LMs will generate outputs that conflict with background knowledge or input text. REMEDI thus links work on probing, prompting, and LM editing, and offers steps toward general tools for fine-grained inspection and control of knowledge in LMs.
more » « less
Full Text Available
An Incomplete Loop: Instruction Inference, Instruction Following, and In-context Learning in Language Models

Liu, Emmy; Neubig, Graham; Andreas, Jacob (October 2024, Conference on Language Modeling)

Full Text Available
Unforgettable Generalization in Language Models

Zhang, Eric; Choshen, Leshem; Andreas, Jacob (October 2024, Conference on Language Modeling)

Full Text Available
Toward In-Context Teaching: Adapting Examples to Students’ Misconceptions

Ross, Alexis; Andreas, Jacob (August 2024, Proceedings of the Annual Meeting of the Association for Computational Linguistics)
Deductive Closure Training of Language Models for Coherence, Accuracy and Updatability

Akyürek, Afra Feyza; Akyürek, Ekin; Choshen, Leshem; Wijaya, Derry; Andreas, Jacob (August 2024, Findings of the Association for Computational Linguistics)
LILO: Learning interpretable libraries by compressing and documenting code

Grand, Gabriel; Wong, Lionel; Bowers, Maddy; Olausson, Theo; Liu, Muxin; Tenenbaum, Joshua; Andreas, Jacob (May 2024, International Conference on Learning Representations)
Learning with Language-Guided State Abstractions

Peng, Andi; Sucholutsky, Ilia; Li, Belinda; Sumers, Theodore; Griffiths, Thomas; Andreas, Jacob; Shah, Julie (May 2024, International Conference on Learning Representations)
Modeling boundedly rational agents with latent inference budgets

Jacob, Athul Paul; Gupta, Abhishek; Andreas, Jacob (May 2024, International Conference on Learning Representations)

We study the problem of modeling a population of agents pursuing unknown goals subject to unknown computational constraints. In standard models of bounded rationality, sub-optimal decision-making is simulated by adding homoscedastic noise to optimal decisions rather than actually simulating constrained inference. In this work, we introduce a latent inference budget model (L-IBM) that models these constraints explicitly, via a latent variable (inferred jointly with a model of agents’ goals) that controls the runtime of an iterative inference algorithm. L-IBMs make it possible to learn agent models using data from diverse populations of suboptimal actors. In three modeling tasks—inferring navigation goals from routes, inferring communicative intents from human utterances, and predicting next moves in human chess games—we show that L-IBMs match or outperforms Boltzmann models of decision-making under uncertainty. Moreover, the inferred inference budgets are themselves meaningful, efficient to compute, and correlated with measures of player skill, partner skill and task difficulty.
more » « less
Full Text Available

Search for: All records