NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

An empirical evaluation of pre-trained large language models for repairing declarative formal specifications

https://doi.org/10.1007/s10664-025-10687-1

Alhanahnah, Mohannad; Rashedul_Hasan, Md; Xu, Lisong; Bagheri, Hamid (September 2025, Empirical Software Engineering)

Abstract Automatic Program Repair (APR) has garnered significant attention as a practical research domain focused on automatically fixing bugs in programs. While existing APR techniques primarily target imperative programming languages like C and Java, there is a growing need for effective solutions applicable to declarative software specification languages. This paper systematically investigates the capacity of Large Language Models (LLMs) to repair declarative specifications in Alloy, a declarative formal language used for software specification. We designed six different repair settings, encompassing single-agent and dual-agent paradigms, utilizing various LLMs. These configurations also incorporate different levels of feedback, including an auto-prompting mechanism for generating prompts autonomously using LLMs. Our study reveals that dual-agent with auto-prompting setup outperforms the other settings, albeit with a marginal increase in the number of iterations and token usage. This dual-agent setup demonstrated superior effectiveness compared to state-of-the-art Alloy APR techniques when evaluated on a comprehensive set of benchmarks. This work is the first to empirically evaluate LLM capabilities to repair declarative specifications, while taking into account recent trending LLM concepts such as LLM-based agents, feedback, auto-prompting, and tools, thus paving the way for future agent-based techniques in software engineering.
more » « less
Free, publicly-accessible full text available September 1, 2026
Evolutionary Analysis of Alloy Specifications with an Adaptive Fitness Function

Wang, J; Stevens, C; Kidmose, B; Cohen, MB; Bagheri, H (July 2024, Search-Based Software Engineering. SSBSE 2024. Lecture Notes in Computer Science, vol 14767. Springer)
Jahangirova, G; Khomh, F (Ed.)
Full Text Available
Scalable Relational Analysis via Relational Bound Propagation

https://doi.org/10.1145/3597503.3639171

Stevens, Clay; Bagheri, Hamid (April 2024, ICSE '24: Proceedings of the IEEE/ACM 46th International Conference on Software Engineering)

Full Text Available
An Empirical Study Assessing Software Modeling in Alloy

https://doi.org/10.1109/FormaliSE58978.2023.00013

Mansoor, Niloofar; Bagheri, Hamid; Kang, Eunsuk; Sharif, Bonita (May 2023, 2023 IEEE/ACM 11th International Conference on Formal Methods in Software Engineering (FormaliSE))

Full Text Available
Parasol: efficient parallel synthesis of large model spaces

https://doi.org/10.1145/3540250.3549157

Stevens, Clay; Bagheri, Hamid (November 2022, ESEC/FSE 2022: Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering)

Full Text Available
ICEBAR: Feedback-Driven Iterative Repair of Alloy Specifications

https://doi.org/10.1145/3551349.3556944

Gutiérrez Brida, Simón; Regis, Germán; Zheng, Guolong; Bagheri, Hamid; Nguyen, Thanhvu; Aguirre, Nazareno; Frias, Marcelo (October 2022, ASE '22: Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering)

Full Text Available
Combining solution reuse and bound tightening for efficient analysis of evolving systems

https://doi.org/10.1145/3533767.3534399

Stevens, Clay; Bagheri, Hamid (July 2022, ISSTA 2022: Proceedings of the 31st ACM SIGSOFT International Symposium on Software Testing and Analysis)

Full Text Available
SAINTDroid: Scalable, Automated Incompatibility Detection for Android

https://doi.org/10.1109/DSN53405.2022.00062

Silva, Bruno; Stevens, Clay; Mansoor, Niloofar; Srisa-An, Witawas; Yu, Tingting; Bagheri, Hamid (June 2022, 2022 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN))

Full Text Available

Search for: All records