NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Resource-efficient Inference with Foundation Model Programs

Ni, Lunyiu; Ding, Zhimin; Yu, Kevin; Cheung, Marco; Jermaine, Christopher; Chaudhuri, Swarat (October 2025, Conference on Language Models (COLM) 2025)

Free, publicly-accessible full text available October 7, 2026
Prompt Tuning Strikes Back: Customizing Foundation Models with Low-Rank Prompt Adaptation

Jain, Abhinav; Chaudhuri, Swarat; Reps, Thomas; Jermaine, Christopher (December 2024, Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024)

Full Text Available
Symbolic Regression with a Learned Concept Library

Grayeli, Arya; Sehgal, Atharva; Costilla, Omar; Cranmer, Miles; Chaudhuri, Swarat (December 2024, NeurIPS 2024)

We present a novel method for symbolic regression (SR), the task of searching for compact programmatic hypotheses that best explain a dataset. The problem is commonly solved using genetic algorithms; we show that we can enhance such methods by inducing a library of abstract textual concepts. Our algorithm, called LaSR, uses zero-shot queries to a large language model (LLM) to discover and evolve concepts occurring in known high-performing hypotheses. We discover new hypotheses using a mix of standard evolutionary steps and LLM-guided steps (obtained through zero-shot LLM queries) conditioned on discovered concepts. Once discovered, hypotheses are used in a new round of concept abstraction and evolution. We validate LaSR on the Feynman equations, a popular SR benchmark, as well as a set of synthetic tasks. On these benchmarks, LaSR substantially outperforms a variety of state-of-the-art SR approaches based on deep learning and evolutionary algorithms. Moreover, we show that LASR can be used to discover a new and powerful scaling law for LLMs.
more » « less
Full Text Available
Prompt Tuning Strikes Back: Customizing Foundation Models with Low-Rank Prompt Adaptation

Jain, Abhinav; Chaudhuri, Swarat; Reps, Thomas W; Jermaine, Christopher M (December 2024, http://papers.nips.cc/paper_files/paper/2024/hash/548551c07a68c8f0a87d67c6167cedb1-Abstract-Conference.html)

Parameter-Efficient Fine-Tuning (PEFT) has become the standard for customising Foundation Models (FMs) to user-specific downstream tasks. However, typical PEFT methods require storing multiple task-specific adapters, creating scalability issues as these adapters must be housed and run at the FM server. Traditional prompt tuning offers a potential solution by customising them through task-specific input prefixes, but it under-performs compared to other PEFT methods like LoRA. To address this gap, we propose Low-Rank Prompt Adaptation (LoPA), a prompttuning-based approach that performs on par with state-of-the-art PEFT methods and full fine-tuning while being more parameter-efficient and not requiring a server-based adapter. LoPA generates soft prompts by balancing between sharing task-specific information across instances and customization for each instance. It uses a low-rank decomposition of the soft-prompt component encoded for each instance to achieve parameter efficiency. We provide a comprehensive evaluation on multiple natural language understanding and code generation and understanding tasks across a wide range of foundation models with varying sizes.
more » « less
Full Text Available
Symbolic Regression with a Learned Concept Library

Grayeli, Arya; Sehgal, Atharva; Costilla-Reyes, Omar; Cranmer, Miles; Chaudhuri, Swarat (December 2024, Annual Conference on Neural Information Processing Systems 2024 (NeurIPS 2024))

Full Text Available
An In-Context Learning Agent for Formal Theorem-Proving

Thakur, Amitayush; Tsoukalas, George; Wen, Yeming; Xin, Jimmy; Chaudhuri, Swarat (October 2024, Conference on Language Models (CoLM 2024))

Full Text Available
PUTNAMBENCH: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

Tsoukalas, George; Lee, Jasper; Jennings, John; Xin, Jimmy; Ding, Michelle; Jennings, Michael; Thakur, Amitayush; Chaudhuri, Swarat (December 2024, Neural Information Processing Systems (NeurIPS), 2024)

We present PUTNAMBENCH, a new multilingual benchmark for evaluating the ability of neural theorem-provers to solve competition mathematics problems. PUTNAMBENCH consists of 1697 hand-constructed formalizations of 640 theorems sourced from the William Lowell Putnam Mathematical Competition, the premier undergraduate-level mathematics competition in North America. All the theorems have formalizations in Lean 4 and Isabelle; a substantial subset also has Coq formalizations. Proving the theorems requires significant problem-solving ability and proficiency in a broad range of topics taught in undergraduate mathematics courses. We use PUTNAMBENCH to evaluate several established neural and symbolic theorem-provers. These approaches can only solve a handful of the PUTNAMBENCH problems, establishing the benchmark as a difficult open challenge for research on neural theorem-proving. PUTNAMBENCH is available at https://github.com/trishullab/PutnamBench.
more » « less
Full Text Available
PUTNAMBENCH: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

Tsoukalas, George; Lee, Jasper; Jennings, John; Xin, Jimmy; Ding, Michelle; Jennings, Michael; Thakur, Amitayush; Chaudhuri, Swarat (December 2024, Neural Information Processing Systems (NeurIPS), Datasets and Benchmarks track)

Full Text Available
PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

Tsoukalas, George; Lee, Jasper; Jennings, John; Xin, Jimmy; Ding, Michelle; Jennings, Michael; Thakur, Amitayush; Chaudhuri, Swarat (December 2024, Neural Information Processing Systems)

We present PutnamBench, a new multi-language benchmark for evaluating the ability of neural theorem-provers to solve competition mathematics problems. PutnamBench consists of 1692 hand-constructed formalizations of 640 theorems sourced from the William Lowell Putnam Mathematical Competition, the premier undergraduate-level mathematics competition in North America. All the problems have formalizations in Lean 4 and Isabelle; a substantial subset also has Coq formalizations. PutnamBench requires significant problem-solving ability and proficiency in a broad range of topics taught in undergraduate mathematics courses. We use PutnamBench to evaluate several established neural and symbolic theorem-provers. These approaches can only solve a handful of the PutnamBench problems, establishing the benchmark as a difficult open challenge for research on neural theorem-proving. PutnamBench is available at https://github.com/trishullab/PutnamBench.
more » « less
Full Text Available
An In-Context Learning Agent for Formal Theorem-Proving

Thakur, Amitayush; Tsoukalas, George; Wen, Yeming; Xin, Jimmy Xin; Chaudhuri, Swarat (October 2024, Conference on Language Models, 2024)

Full Text Available

« Prev Next »

Search for: All records