NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

BatFix: Repairing language model-based transpilation

https://doi.org/10.1145/3658668

Ramos, Daniel; Lynce, Inês; Manquinho, Vasco; Martins, Ruben; Le_Goues, Claire (July 2024, ACM Transactions on Software Engineering and Methodology)

To keep up with changes in requirements, frameworks, and coding practices, software organizations might need to migrate code from one language to another. Source-to-source migration, or transpilation, is often a complex, manual process. Transpilation requires expertise both in the source and target language, making it highly laborious and costly. Languages models for code generation and transpilation are becoming increasingly popular. However, despite capturing code-structure well, code generated by language models is often spurious and contains subtle problems. We proposeBatFix, a novel approach that augments language models for transpilation by leveraging program repair and synthesis to fix the code generated by these models.BatFixtakes as input both the original program, the target program generated by the machine translation model, and a set of test cases and outputs a repaired program that passes all test cases. Experimental results show that our approach is agnostic to language models and programming languages.BatFixcan locate bugs spawning multiple lines and synthesize patches for syntax and semantic bugs for programs migrated fromJavatoC++andPythontoC++from multiple language models, including, OpenAI’sCodex.
more » « less
Full Text Available
Automated Program Repair, What Is It Good For? Not Absolutely Nothing!

https://doi.org/10.1145/3597503.3639095

Eladawy, Hadeel; Le_Goues, Claire; Brun, Yuriy (April 2024, ACM)

Industrial deployments of automated program repair (APR), e.g., at Facebook and Bloomberg, signal a new milestone for this exciting and potentially impactful technology. In these deployments, developers use APR-generated patch suggestions as part of a human-driven debugging process. Unfortunately, little is known about how using patch suggestions affects developers during debugging. This paper conducts a controlled user study with 40 developers with a median of 6 years of experience. The developers engage in debugging tasks on nine naturally-occurring defects in real-world, open-source, Java projects, using Recoder, SimFix, and TBar, three state-of-the-art APR tools. For each debugging task, the developers either have access to the project's tests, or, also, to code suggestions that make all the tests pass. These suggestions are either developer-written or APR-generated, which can be correct or deceptive. Deceptive suggestions, which are a common APR occurrence, make all the available tests pass but fail to generalize to the intended specification. Through a total of 160 debugging sessions, we find that access to a code suggestion significantly increases the odds of submitting a patch. Correct APR suggestions increase the odds of debugging success by 14,000%, but deceptive suggestions decrease the odds of success by 65%. Correct suggestions also speed up debugging. Surprisingly, we observe no significant difference in how novice and experienced developers are affected by APR, suggesting that APR may find uses across the experience spectrum. Overall, developers come away with a strong positive impression of APR, suggesting promise for APR-mediated, human-driven debugging, despite existing challenges in APR-generated repair quality.
more » « less
Full Text Available
Large Language Models for Test-Free Fault Localization

https://doi.org/10.1145/3597503.3623342

Yang, Aidan_Z H; Le_Goues, Claire; Martins, Ruben; Hellendoorn, Vincent (February 2024, ACM)

Full Text Available
MELT: Mining Effective Lightweight Transformations from Pull Requests

https://doi.org/10.1109/ASE56229.2023.00117

Ramos, Daniel; Mitchell, Hailie; Lynce, Inês; Manquinho, Vasco; Martins, Ruben; Goues, Claire Le (September 2023, 38th IEEE/ACM International Conference on Automated Software Engineering (ASE))

Software developers often struggle to update APIs, leading to manual, time-consuming, and error-prone processes. We introduce Melt, a new approach that generates lightweight API migration rules directly from pull requests in popular library repositories. Our key insight is that pull requests merged into open-source libraries are a rich source of information sufficient to mine API migration rules. By leveraging code examples mined from the library source and automatically generated code examples based on the pull requests, we infer transformation rules in Comby, a language for structural code search and replace. Since inferred rules from single code examples may be too specific, we propose a generalization procedure to make the rules more applicable to client projects. Melt rules are syntax-driven, interpretable, and easily adaptable. Moreover, unlike previous work, our approach enables rule inference to seamlessly integrate into the library workflow, removing the need to wait for client code migrations. We evaluated Melt on pull requests from four popular libraries, successfully mining 461 migration rules from code examples in pull requests and 114 rules from auto-generated code examples. Our generalization procedure increases the number of matches for mined rules by 9×. We applied these rules to client projects and ran their tests, which led to an overall decrease in the number of warnings and fixing some test cases demonstrating MELT's effectiveness in real-world scenarios.
more » « less
Patching Locking Bugs Statically with Crayons

https://doi.org/10.1145/3548684

Cruz-Carlon, Juan; Varshosaz, Mahsa; Le_Goues, Claire; Wasowski, Andrzej (April 2023, ACM Transactions on Software Engineering and Methodology)

The Linux Kernel is a world-class operating system controlling most of our computing infrastructure: mobile devices, Internet routers and services, and most of the supercomputers. Linux is also an example of low-level software with no comprehensive regression test suite (for good reasons). The kernel’s tremendous societal importance imposes strict stability and correctness requirements. These properties make Linux a challenging and relevant target for static automated program repair (APR). Over the past decade, a significant progress has been made in dynamic APR. However, dynamic APR techniques do not translate naturally to systems without tests. We present a static APR technique addressing sequentiallocking API misusebugs in the Linux Kernel. We attack the key challenge of static APR, namely, the lack of detailed program specification, by combining static analysis with machine learning to complement the information presented by the static analyzer. In experiments on historical real-world bugs in the kernel, we were able to automatically re-produce or propose equivalent patches in 85% of the human-made patches, and automatically rank them among the top three candidates for 64% of the cases and among the top five for 74%.
more » « less
Mithra: Anomaly Detection as an Oracle for Cyberphysical Systems

https://doi.org/10.1109/TSE.2021.3120680

Afzal, Afsoon; Le Goues, Claire; Timperley, Christopher Steven (November 2022, IEEE Transactions on Software Engineering)

Full Text Available
ROSDiscover: Statically Detecting Run-Time Architecture Misconfigurations in Robotics Systems

https://doi.org/10.1109/ICSA53651.2022.00019

Timperley, Christopher S.; Durschmid, Tobias; Schmerl, Bradley; Garlan, David; Le Goues, Claire (March 2022, 2022 IEEE 19th International Conference on Software Architecture (ICSA))

Full Text Available
Quality of Automated Program Repair on Real-World Defects

https://doi.org/10.1109/TSE.2020.2998785

Motwani, Manish; Soto, Mauricio; Brun, Yuriy; Just, Rene; Le Goues, Claire (February 2022, IEEE Transactions on Software Engineering)

Full Text Available
VarFix: balancing edit expressiveness and search effectiveness in automated program repair

https://doi.org/10.1145/3468264.3468600

Wong, Chu-Pan; Santiesteban, Priscila; Kästner, Christian; Le Goues, Claire (August 2021, ESEC/FSE 2021: Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering)

Full Text Available
An Empirical Study of OSS-Fuzz Bugs

https://doi.org/10.1109/MSR52588.2021.00026

Ding, Zhen Yu; Le Goues, Claire (May 2021, 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR))

Full Text Available

« Prev Next »

Search for: All records