NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Is Programming by Examples Solved by LLMs?

Li, Wen-Ding; Ellis, Kevin (December 2025, NeurIPS)

Free, publicly-accessible full text available December 10, 2026
Doing Experiments and Revising Rules With Natural Language and Probabilistic Reasoning

Piriyakulkij, Top Wasu; Langenfeld, Cassidy; Le, Tuan-Anh; Ellis, Kevin (December 2025, NeurIPS)

Free, publicly-accessible full text available December 10, 2026
Synthesizing theories of human language with Bayesian program induction

https://doi.org/10.1038/s41467-022-32012-w

Ellis, Kevin; Albright, Adam; Solar-Lezama, Armando; Tenenbaum, Joshua B.; O’Donnell, Timothy J. (December 2022, Nature Communications)

Abstract Automated, data-driven construction and evaluation of scientific models and theories is a long-standing challenge in artificial intelligence. We present a framework for algorithmically synthesizing models of a basic part of human language: morpho-phonology, the system that builds word forms from sounds. We integrate Bayesian inference with program synthesis and representations inspired by linguistic theory and cognitive models of learning and discovery. Across 70 datasets from 58 diverse languages, our system synthesizes human-interpretable models for core aspects of each language’s morpho-phonology, sometimes approaching models posited by human linguists. Joint inference across all 70 data sets automatically synthesizes a meta-model encoding interpretable cross-language typological tendencies. Finally, the same algorithm captures few-shot learning dynamics, acquiring new morphophonological rules from just one or a few examples. These results suggest routes to more powerful machine-enabled discovery of interpretable models in linguistics and other scientific domains.
more » « less
Full Text Available
Program Synthesis with Pragmatic Communication

Pu, Yewen; Ellis, Kevin; Kryven, Marta; Tenenbaum, Josh; Solar-Lezama, Armando (December 2020, Advances in neural information processing systems)
null (Ed.)
Full Text Available
Program Synthesis with Pragmatic Communication

Pu, Yewen; Ellis, Kevin; Kryven, Marta; Tenenbaum, Josh; Solar-Lezama, Armando (December 2020, Advances in neural information processing systems)

Full Text Available
Neurosymbolic Programming

https://doi.org/10.1561/2500000049

Chaudhuri, Swarat; Ellis, Kevin; Polozov, Oleksandr; Singh, Rishabh; Solar-Lezama, Armando; Yue, Yisong (January 2021, Foundations and Trends® in Programming Languages)

Full Text Available
Neurosymbolic Programming

https://doi.org/10.1561/9781680839357

Chaudhuri, Swarat; Ellis, Kevin; Polozov, Oleksandr; Singh, Rishabh; Solar-Lezama, Armando; Yue, Yisong (January 2021, Foundations and trends in programming languages)

Full Text Available
DreamCoder: bootstrapping inductive program synthesis with wake-sleep library learning

https://doi.org/10.1145/3453483.3454080

Ellis, Kevin; Wong, Catherine; Nye, Maxwell; Sablé-Meyer, Mathias; Morales, Lucas; Hewitt, Luke; Cary, Luc; Solar-Lezama, Armando; Tenenbaum, Joshua B. (June 2021, International Conference on Programming Language Design and Implementation)

Full Text Available
Top-Down Synthesis for Library Learning

https://doi.org/10.1145/3571234

Bowers, Matthew; Olausson, Theo_X; Wong, Lionel; Grand, Gabriel; Tenenbaum, Joshua_B; Ellis, Kevin; Solar-Lezama, Armando (January 2023, Proceedings of the ACM on Programming Languages)

This paper introducescorpus-guided top-down synthesisas a mechanism for synthesizing library functions that capture common functionality from a corpus of programs in a domain specific language (DSL). The algorithm builds abstractions directly from initial DSL primitives, using syntactic pattern matching of intermediate abstractions to intelligently prune the search space and guide the algorithm towards abstractions that maximally capture shared structures in the corpus. We present an implementation of the approach in a tool called Stitch and evaluate it against the state-of-the-art deductive library learning algorithm from DreamCoder. Our evaluation shows that Stitch is 3-4 orders of magnitude faster and uses 2 orders of magnitude less memory while maintaining comparable or better library quality (as measured by compressivity). We also demonstrate Stitch’s scalability on corpora containing hundreds of complex programs that are intractable with prior deductive approaches and show empirically that it is robust to terminating the search procedure early—further allowing it to scale to challenging datasets by means of early stopping.
more » « less
Learning to Infer Graphics Programs from Hand-Drawn Images

Ellis, Kevin; Ritchie, Daniel; Solar-Lezama, Armando; Tenenbaum, Josh (December 2018, Advances in neural information processing systems)

Full Text Available

Search for: All records