NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A Tool for Generating Exceptional Behavior Tests With Large Language Models

https://doi.org/10.1145/3696630.3728608

Zhong, Linghan; Yuan, Samuel; Zhang, Jiyang; Liu, Yu; Nie, Pengyu; Li, Junyi Jessy; Gligoric, Milos (June 2025, ACM)

Free, publicly-accessible full text available June 23, 2026
exLong: Generating Exceptional Behavior Tests with Large Language Models

Zhang, Jiyang; Liu, Yu; Nie, Pengyu; Li, Junyi Jessy; Gligoric, Milos (April 2025, International Conference on Software Engineering)

Free, publicly-accessible full text available April 28, 2026
Multilingual Code Co-evolution using Large Language Models

https://doi.org/10.1145/3611643.3616350

Zhang, Jiyang; Nie, Pengyu; Li, Junyi Jessy; Gligoric, Milos (November 2023, ACM)

Full Text Available
More Precise Regression Test Selection via Reasoning about Semantics-Modifying Changes

https://doi.org/10.1145/3597926.3598086

Liu, Yu; Zhang, Jiyang; Nie, Pengyu; Gligoric, Milos; Legunsen, Owolabi (July 2023, ACM)

Regression test selection (RTS) speeds up regression testing by only re-running tests that might be affected by code changes. Ideal RTS safely selects all affected tests and precisely selects only affected tests. But, aiming for this ideal is often slower than re-running all tests. So, recent RTS techniques use program analysis to trade precision for speed, i.e., lower regression testing time, or even use machine learning to trade safety for speed. We seek to make recent analysis-based RTS techniques more precise, to further speed up regression testing. Independent studies suggest that these techniques reached a “performance wall” in the speed-ups that they provide. We manually inspect code changes to discover those that do not require re-running tests that are only affected by such changes. We categorize 29 kinds of changes that we found from five projects into 13 findings, 11 of which are semantics-modifying. We enhance two RTS techniques—Ekstazi and STARTS—to reason about our findings. Using 1,150 versions of 23 projects, we evaluate the impact on safety and precision of leveraging such changes. We also evaluate if our findings from a few projects can speed up regression testing in other projects. The results show that our enhancements are effective and they can generalize. On average, they result in selecting 41.7% and 31.8% fewer tests, and take 33.7% and 28.7% less time than Ekstazi and STARTS, respectively, with no loss in safety.
more » « less
Full Text Available
Extracting Inline Tests from Unit Tests

https://doi.org/10.1145/3597926.3598149

Liu, Yu; Nie, Pengyu; Guo, Anna; Gligoric, Milos; Legunsen, Owolabi (July 2023, ACM)

We recently proposed inline tests for validating individual program statements; they allow developers to provide test inputs, expected outputs, and test oracles immediately after a target statement. But, existing code can have many target statements. So, automatic generation of inline tests is an important next step towards increasing their adoption. We propose ExLi, the first technique for automatically generating inline tests. ExLi extracts inline tests from unit tests; it first records all variable values at a target statement while executing unit tests. Then, ExLi uses those values as test inputs and test oracles in an initial set of generated inline tests. Target statements that are executed many times could have redundant initial inline tests. So, ExLi uses a novel coverage-then-mutants based reduction process to remove redundant inline tests. We implement ExLi for Java and use it to generate inline tests for 718 target statements in 31 open-source programs. ExLi reduces 17,273 initially generated inline tests to 905 inline tests. The final set of generated inline tests kills up to 25.1% more mutants on target statements than developer written and automatically generated unit tests. That is, ExLi generates inline tests that can improve the fault-detection capability of the test suites from which they are extracted.
more » « less
Full Text Available
pytest-inline: An Inline Testing Tool for Python

https://doi.org/10.1109/ICSE-Companion58688.2023.00046

Liu, Yu; Thurston, Zachary; Han, Alan; Nie, Pengyu; Gligoric, Milos; Legunsen, Owolabi (May 2023, IEEE)

We present pytest-inline, the first inline testing framework for Python. We recently proposed inline tests to make it easier to test individual program statements. But, there is no framework-level support for developers to write inline tests in Python. To fill this gap, we design and implement pytest-inline as a plugin for pytest, the most popular Python testing framework. Using pytest-inline, a developer can write an inline test by assigning test inputs to variables in a target statement and specifying the expected test output. Then, pytest-inline runs each inline test and fails if the target statement’s output does not match the expected output. In this paper, we describe our design of pytestinline, the testing features that it provides, and the intended use cases. Our evaluation on inline tests that we wrote for 80 target statements from 31 open-source Python projects shows that using pytest-inline incurs negligible overhead, at 0.012x. pytest-inline is integrated into the pytest-dev organization, and a video demo is at https://www.youtube.com/watch?v=pZgiAxR_uJg.
more » « less
Full Text Available
Learning Deep Semantics for Test Completion

https://doi.org/10.1109/ICSE48619.2023.00178

Nie, Pengyu; Banerjee, Rahul; Li, Junyi Jessy; Mooney, Raymond J.; Gligoric, Milos (May 2023, International Conference on Software Engineering)

Full Text Available
Inline Tests

https://doi.org/10.1145/3551349.3556952

Liu, Yu; Nie, Pengyu; Legunsen, Owolabi; Gligoric, Milos (October 2022, Inline Tests)

Full Text Available
pytest-inline: An Inline Testing Tool for Python

Liu, Yu; Thurston, Zachary; Han, Alan; Nie, Pengyu; Gligoric, Milos; Legunsen, Owolabi (January 2023, International Conference on Software Engineering, Tool Demonstrations Track)

Full Text Available
CoditT5: Pretraining for Source Code and Natural Language Editing

https://doi.org/10.1145/3551349.3556955

Zhang, Jiyang; Panthaplackel, Sheena; Nie, Pengyu; Li, Junyi Jessy; Gligoric, Milos (October 2022, CoditT5: Pretraining for Source Code and Natural Language Editing)

Full Text Available

« Prev Next »

Search for: All records