NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Interleaving static analysis and LLM prompting with applications to error specification inference

https://doi.org/10.1007/S10009-025-00780-7

Chapman, Patrick J; Rubio-González, Cindy; Thakur, Aditya V (April 2025, International Journal on Software Tools for Technology Transfer)

Abstract This paper presents a new approach to improve static program analysis using Large Language Models (LLMs). The approachinterleavescalls to the static analyzer and queries to the LLM. The query to the LLM is constructed based on intermediate results from the static analysis, and subsequent static analysis uses the results from the LLM query. We apply our approach to the problem oferror-specification inference: given systems code written in C, infer the set of values that each function can return on error. Such error specifications aid in program understanding and can be used to find error-handling bugs. We implemented our approach by incorporating LLMs into EESI, the state-of-the-art static analysis for error-specification inference. Compared to EESI, our approach achieves higher recall (from an average of 52.55% to 77.83%) and higher F1-score (from an average of 0.612 to 0.804) while maintaining precision (from an average of 86.67% to 85.12%) on real-world benchmarks such as Apache HTTPD and MbedTLS. We also conducted experiments to understand the sources of imprecision in our LLM-assisted analysis as well as the impact of LLM nondeterminism on the analysis results.
more » « less
Free, publicly-accessible full text available April 1, 2026
Do Programmers Prefer Predictable Expressions in Code?

https://doi.org/10.1111/cogs.12921

Casalnuovo, Casey; Lee, Kevin; Wang, Hulin; Devanbu, Prem; Morgan, Emily (December 2020, Cognitive Science)

Abstract Source code is a form of human communication, albeit one where the information shared between the programmers reading and writing the code is constrained by the requirement that the code executes correctly. Programming languages are more syntactically constrained than natural languages, but they are also very expressive, allowing a great many different ways to express even very simple computations. Still, code written by developers is highly predictable, and many programming tools have taken advantage of this phenomenon, relying on language modelsurprisalas a guiding mechanism. While surprisal has been validated as a measure of cognitive load in natural language, its relation to human cognitive processes in code is still poorly understood. In this paper, we explore the relationship between surprisal and programmer preference at a small granularity—do programmers prefer more predictable expressions in code? Usingmeaning‐preserving transformations, we produce equivalent alternatives to developer‐written code expressions and run a corpus study on Java and Python projects. In general, language models rate the code expressions developerschooseto write as more predictable than these transformed alternatives. Then, we perform two human subject studies asking participants to choose between two equivalent snippets of Java code with different surprisal scores (one original and transformed). We find that programmersdoprefer more predictable variants, and that stronger language models like the transformer align more often and more consistently with these preferences.
more » « less
Interleaving Static Analysis and LLM Prompting

https://doi.org/10.1145/3652588.3663317

Chapman, Patrick J; Rubio-González, Cindy; Thakur, Aditya V (June 2024, ACM)

Full Text Available
Automatic Semantic Augmentation of Language Model Prompts (for Code Summarization)

https://doi.org/10.1145/3597503.3639183

Ahmed, Toufique; Pai, Kunal Suresh; Devanbu, Premkumar; Barr, Earl (April 2024, ACM)

Full Text Available
Better Patching Using LLM Prompting, via Self-Consistency

https://doi.org/10.1109/ASE56229.2023.00065

Ahmed, Toufique; Devanbu, Premkumar (September 2023, IEEE)

Full Text Available
Large Language Models and Simple, Stupid Bugs

https://doi.org/10.1109/MSR59073.2023.00082

Jesse, Kevin; Ahmed, Toufique; Devanbu, Premkumar T.; Morgan, Emily (May 2023, Int'l Conference on Mining Software Repositories)

Full Text Available
SynShine: Improved Fixing of Syntax Errors

https://doi.org/10.1109/TSE.2022.3212635

Ahmed, Toufique; Ledesma, Noah Rose; Devanbu, Premkumar (April 2023, IEEE Transactions on Software Engineering)

Full Text Available
Extending Source Code Pre-Trained Language Models to Summarise Decompiled Binaries

https://doi.org/10.1109/SANER56733.2023.00033

Al-Kaswan, Ali; Ahmed, Toufique; Izadi, Maliheh; Sawant, Anand Ashok; Devanbu, Premkumar; van_Deursen, Arie (March 2023, IEEE)

Full Text Available
NatGen: generative pre-training by “naturalizing” source code

https://doi.org/10.1145/3540250.3549162

Chakraborty, Saikat; Ahmed, Toufique; Ding, Yangruibo; Devanbu, Premkumar T.; Ray, Baishakhi (November 2022, ESEC/FSE 2022: Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering)

Full Text Available
Few-shot training LLMs for project-specific code-summarization

https://doi.org/10.1145/3551349.3559555

Ahmed, Toufique; Devanbu, Premkumar (October 2022, ACM)

Full Text Available

« Prev Next »

Search for: All records