NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

On the Effectiveness of LLM-as-a-Judge for Code Generation and Summarization

https://doi.org/10.1109/TSE.2025.3586082

Crupi, Giuseppe; Tufano, Rosalia; Velasco, Alejandro; Mastropaolo, Antonio; Poshyvanyk, Denys; Bavota, Gabriele (August 2025, IEEE Transactions on Software Engineering)

Free, publicly-accessible full text available August 1, 2026
A 2030 Roadmap for Software Engineering

https://doi.org/10.1145/3731559

Pezzè, Mauro; Abrahão, Silvia; Penzenstadler, Birgit; Poshyvanyk, Denys; Roychoudhury, Abhik; Yue, Tao (June 2025, ACM Transactions on Software Engineering and Methodology)

The landscape of software engineering has dramatically changed in recent years. The impressive advances of artificial intelligence are just the latest and most disruptive innovation that has remarkably changed the software engineering research and practice. This special issue shares a roadmap to guide the software engineering community in this confused era. This roadmap is the outcome of a 2-day intensive discussion at the2030 Software Engineeringworkshop. The roadmap spotlights and discusses seven main landmarks in the new software engineering landscape: artificial intelligence for software engineering, human aspects of software engineering, software security, verification and validation, sustainable software engineering, automatic programming, and quantum software engineering. This editorial summarizes the core aspects discussed in the 37 papers that comprise the seven sections of the special issue and guides the interested readers throughout the issue. This roadmap is a living body that we will refine with follow-up workshops that will update the roadmap for a series of forthcoming ACM TOSEM special issues.
more » « less
Free, publicly-accessible full text available June 30, 2026
Artificial Intelligence for Software Engineering: The Journey So Far and the Road Ahead

https://doi.org/10.1145/3719006

Ahmed, Iftekhar; Aleti, Aldeida; Cai, Haipeng; Chatzigeorgiou, Alexander; He, Pinjia; Hu, Xing; Pezzè, Mauro; Poshyvanyk, Denys; Xia, Xin (June 2025, ACM Transactions on Software Engineering and Methodology)

Artificial intelligence and recent advances in deep learning architectures, including transformer networks and large language models, change the way people think and act to solve problems. Software engineering, as an increasingly complex process to design, develop, test, deploy, and maintain large-scale software systems for solving real-world challenges, is profoundly affected by many revolutionary artificial intelligence tools in general and machine learning in particular. In this roadmap for artificial intelligence in software engineering, we highlight the recent deep impact of artificial intelligence on software engineering by discussing successful stories of applications of artificial intelligence to classic and new software development challenges. We identify the new challenges that the software engineering community has to address in the coming years to successfully apply artificial intelligence in software engineering, and we share our research roadmap toward the effective use of artificial intelligence in the software engineering profession, while still protecting fundamental human values. We spotlight three main areas that challenge the research in software engineering: the use of generative artificial intelligence and large language models for engineering large software systems, the need of large and unbiased datasets and benchmarks for training and evaluating deep learning and large language models for software engineering, and the need of a new code of digital ethics to apply artificial intelligence in software engineering.
more » « less
Free, publicly-accessible full text available June 30, 2026
A Path Less Traveled: Reimagining Software Engineering Automation via a Neurosymbolic Paradigm

https://doi.org/10.1145/3696630.3728720

Mastropaolo, Antonio; Poshyvanyk, Denys (June 2025, ACM)

Free, publicly-accessible full text available June 23, 2026
SnipGen: A Mining Repository Framework for Evaluating LLMs for Code

https://doi.org/10.1109/MSR66628.2025.00084

Rodriguez-Cardenas, Daniel; Velasco, Alejandro; Poshyvanyk, Denys (April 2025, IEEE)

Free, publicly-accessible full text available April 28, 2026
How Propense Are Large Language Models at Producing Code Smells? A Benchmarking Study

https://doi.org/10.1109/ICSE-NIER66352.2025.00025

Velasco, Alejandro; Rodriguez-Cardenas, Daniel; Alif, Luftar Rahman; Palacio, David N; Poshyvanyk, Denys (April 2025, IEEE)

Free, publicly-accessible full text available April 27, 2026
Toward Neurosymbolic Program Comprehension

https://doi.org/10.1109/ICPC66645.2025.00047

Velasco, Alejandro; Garryyeva, Aya; Palacio, David N; Mastropaolo, Antonio; Poshyvanyk, Denys (April 2025, IEEE)

Free, publicly-accessible full text available April 27, 2026
Towards More Trustworthy Deep Code Models by Enabling Out-of-Distribution Detection

https://doi.org/10.1109/ICSE55347.2025.00177

Yan, Yanfu; Duong, Viet; Shao, Huajie; Poshyvanyk, Denys (April 2025, IEEE)

Free, publicly-accessible full text available April 26, 2026
All for One: LLMs Solve Mental Math at the Last Token With Information Transferred From Other Tokens

https://doi.org/10.18653/v1/2025.emnlp-main.1565

Mamidanna, Siddarth; Rai, Daking; Yao, Ziyu; Zhou, Yilun (January 2025, Association for Computational Linguistics)

Full Text Available
Enhancing Code Understanding for Impact Analysis by Combining Transformers and Program Dependence Graphs

https://doi.org/10.1145/3643770

Yan, Yanfu; Cooper, Nathan; Moran, Kevin; Bavota, Gabriele; Poshyvanyk, Denys; Rich, Steve (July 2024, Proceedings of the ACM on Software Engineering)

Impact analysis (IA) is a critical software maintenance task that identifies the effects of a given set of code changes on a larger software project with the intention of avoiding potential adverse effects. IA is a cognitively challenging task that involves reasoning about the abstract relationships between various code constructs. Given its difficulty, researchers have worked to automate IA with approaches that primarily use coupling metrics as a measure of the connectedness of different parts of a software project. Many of these coupling metrics rely on static, dynamic, or evolutionary information and are based on heuristics that tend to be brittle, require expensive execution analysis, or large histories of co-changes to accurately estimate impact sets. In this paper, we introduce a novel IA approach, called ATHENA, that combines a software system's dependence graph information with a conceptual coupling approach that uses advances in deep representation learning for code without the need for change histories and execution information. Previous IA benchmarks are small, containing less than ten software projects, and suffer from tangled commits, making it difficult to measure accurate results. Therefore, we constructed a large-scale IA benchmark, from 25 open-source software projects, that utilizes fine-grained commit information from bug fixes. On this new benchmark, our best performing approach configuration achieves an mRR, mAP, and HIT@10 score of 60.32%, 35.19%, and 81.48%, respectively. Through various ablations and qualitative analyses, we show that ATHENA's novel combination of program dependence graphs and conceptual coupling information leads it to outperform a simpler baseline by 10.34%, 9.55%, and 11.68% with statistical significance.
more » « less
Full Text Available

« Prev Next »

Search for: All records