NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The Surprising Effectiveness of Test-Time Training for Few-Shot Learning

Akyürek, Ekin; Damani, Mehlu; Zweiger, Adam; Qiu, Linlu; Guo, Han; Pari, Jyothish; Kim, Yoon; Andreas, Jacob (July 2025, International Conference on Machine Learning)

Free, publicly-accessible full text available July 13, 2026
Eliciting Human Preferences with Language Models

Li, Belinda; Tamkin, Alex; Goodman, Noah; Andreas, Jacob (April 2025, International Conference on Learning Representations)

Free, publicly-accessible full text available April 24, 2026
Learning how hard to think: Input-adaptive allocation of LM computation

Damani, Mehul; Shenfeld, Idan; Peng, Andi; Bobu, Andreea; Andreas, Jacob (April 2025, International Conference on Learning Representations)

Free, publicly-accessible full text available April 24, 2026
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning

Poddar, Sriyash; Wan, Yanming; Ivison, Hamish; Gupta, Abhishek; Jaques, Natasha (December 2024, Neural Information Processing Systems 2024)

Reinforcement Learning from Human Feedback (RLHF) is a powerful paradigm for aligning foundation models to human values and preferences. However, current RLHF techniques cannot account for the naturally occurring differences in individual human preferences across a diverse population. When these differences arise, traditional RLHF frameworks simply average over them, leading to inaccurate rewards and poor performance for individual subgroups. To address the need for pluralistic alignment, we develop a class of multimodal RLHF methods. Our proposed techniques are based on a latent variable formulation - inferring a novel user-specific latent and learning reward models and policies conditioned on this latent without additional user-specific data. While conceptually simple, we show that in practice, this reward modeling requires careful algorithmic considerations around model architecture and reward scaling. To empirically validate our proposed technique, we first show that it can provide a way to combat under- specification in simulated control problems, inferring and optimizing user-specific reward functions. Next, we conduct experiments on pluralistic language datasets representing diverse user preferences and demonstrate improved reward function accuracy. We additionally show the benefits of this probabilistic framework in terms of measuring uncertainty, and actively learning user preferences. This work enables learning from diverse populations of users with divergent preferences, an important challenge that naturally occurs in problems from robot learning to foundation model alignment.
more » « less
Full Text Available
Learning to Cooperate with Humans using Generative Agents

Liang, Yancheng; Chen, Daphne; Gupta, Abhishek; Du, Simon; Jaques, Natasha (December 2024, Neural Information Processing Systems 2024)

Training agents that can coordinate zero-shot with humans is a key mission in multi-agent reinforcement learning (MARL). Current algorithms focus on training simulated human partner policies which are then used to train a Cooperator agent. The simulated human is produced either through behavior cloning over a dataset of human cooperation behavior, or by using MARL to create a population of simulated agents. However, these approaches often struggle to produce a Cooperator that can coordinate well with real humans, since the simulated humans fail to cover the diverse strategies and styles employed by people in the real world. We show \emph{learning a generative model of human partners} can effectively address this issue. Our model learns a latent variable representation of the human that can be regarded as encoding the human's unique strategy, intention, experience, or style. This generative model can be flexibly trained from any (human or neural policy) agent interaction data. By sampling from the latent space, we can use the generative model to produce different partners to train Cooperator agents. We evaluate our method -- \textbf{G}enerative \textbf{A}gent \textbf{M}odeling for \textbf{M}ulti-agent \textbf{A}daptation (GAMMA) -- on Overcooked, a challenging cooperative cooking game that has become a standard benchmark for zero-shot coordination. We conduct an evaluation with real human teammates, and the results show that GAMMA consistently improves performance, whether the generative model is trained on simulated populations or human datasets. Further, we propose a method for posterior sampling from the generative model that is biased towards the human data, enabling us to efficiently improve performance with only a small amount of expensive human interaction data.
more » « less
Full Text Available
Adaptive Language-Guided Abstraction from Contrastive Explanations

Peng, Andi; Li, Belinda; Sucholutsky, Ilia; Kumar, Nishanth; Shah, Julie; Andreas, Jacob; Bobu, Andreea (November 2024, Conference on Robot Learning)

Full Text Available
Toward In-Context Teaching: Adapting Examples to Students’ Misconceptions

Ross, Alexis; Andreas, Jacob (August 2024, Proceedings of the Annual Meeting of the Association for Computational Linguistics)
Learning Phonotactics from Linguistic Informants

Breiss, Canaan; Ross, Alexis; Maina-Kilaas, Amani; Levy, Roger; Andreas, Jacob (June 2024, Society for Computation in Linguistics)

We propose an interactive approach to language learning that utilizes linguistic acceptability judgments from an informant (a competent lan- guage user) to learn a grammar. Given a gram- mar formalism and a framework for synthesiz- ing data, our model iteratively selects or synthe- sizes a data-point according to one of a range of information-theoretic policies, asks the in- formant for a binary judgment, and updates its own parameters in preparation for the next query. We demonstrate the effectiveness of our model in the domain of phonotactics, the rules governing what kinds of sound-sequences are acceptable in a language, and carry out two experiments, one with typologically-natural linguistic data and another with a range of procedurally-generated languages. We find that the information-theoretic policies that our model uses to select items to query the infor- mant achieve sample efficiency comparable to, and sometimes greater than, fully supervised approaches.
more » « less
Full Text Available
The consensus game: Language model generation via equilibrium search

Jacob, Athul Paul; Shen, Yikang; Farina, Gabriele; Andreas, Jacob (May 2024, International Conference on Learning Representations)

When applied to question answering and other text generation tasks, language models (LMs) may be queried generatively (by sampling answers from their output distribution) or discriminatively (by using them to score or rank a set of candidate outputs). These procedures sometimes yield very different predictions. How do we reconcile mutually incompatible scoring procedures to obtain coherent LM predictions? We introduce a new training-free, game-theoretic procedure for language model decoding. Our approach casts language model decoding as a regularized imperfect-information sequential signaling game—which we term the CONSENSUS GAME—in which a GENERATOR seeks to communicate an abstract correctness parameter using natural language sentences to a DISCRIMINATOR. We develop computational procedures for finding approximate equilibria of this game, resulting in a decoding algorithm we call EQUILIBRIUM-RANKING. Applied to a large number of tasks (including reading comprehension, commonsense reasoning, mathematical problem-solving, and dialog), EQUILIBRIUM-RANKING consistently, and sometimes substantially, improves performance over existing LM decoding procedures—on multiple benchmarks, we observe that applying EQUILIBRIUM- RANKING to LLaMA-7B outperforms the much larger LLaMA-65B and PaLM- 540B models. These results highlight the promise of game-theoretic tools for addressing fundamental challenges of truthfulness and consistency in LMs.
more » « less
Full Text Available
Learning adaptive planning representations with natural language guidance

Wong, Lionel; Mao, Jiayuan; Sharma, Pratyusha; Siegel, Zachary; Feng, Jiahai; Korneev, Noa; Tenenbaum, Joshua B; Andreas, Jacob (May 2024, International Conference on Learning Representations)

Effective planning in the real world requires not only world knowledge, but the ability to leverage that knowledge to build the right representation of the task at hand. Decades of hierarchical planning techniques have used domain-specific temporal action abstractions to support efficient and accurate planning, almost always relying on human priors and domain knowledge to decompose hard tasks into smaller subproblems appropriate for a goal or set of goals. This paper describes Ada (Action Domain Acquisition), a framework for automatically constructing task-specific planning representations using task-general background knowledge from language models (LMs). Starting with a general-purpose hierarchical planner and a low-level goal-conditioned policy, Ada interactively learns a library of planner-compatible high-level action abstractions and low-level controllers adapted to a particular domain of planning tasks. On two language-guided interactive planning benchmarks (Mini Minecraft and ALFRED Household Tasks), Ada strongly outperforms other approaches that use LMs for sequential decision- making, offering more accurate plans and better generalization to complex tasks.
more » « less
Full Text Available

« Prev Next »

Search for: All records