NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The SGSM framework: Enabling the specification and monitor synthesis of safe driving properties through scene graphs

https://doi.org/10.1016/j.scico.2024.103252

Woodlief, Trey; Toledo, Felipe; Elbaum, Sebastian; Dwyer, Matthew B (May 2025, Science of Computer Programming)

Free, publicly-accessible full text available May 1, 2026
A Differential Testing Framework to Identify Critical AV Failures Leveraging Arbitrary Inputs

https://doi.org/10.1109/ICSE55347.2025.00163

Woodlief, Trey; Hildebrandt, Carl; Elbaum, Sebastian (May 2025, IEEE Computer Society - IEEE/ACM 47th International Conference on Software Engineering (ICSE))

The proliferation of autonomous vehicles (AVs) has made their failures increasingly evident. Testing efforts aimed at identifying the inputs leading to those failures are challenged by the input’s long-tail distribution, whose area under the curve is dominated by rare scenarios. We hypothesize that leveraging emerging open-access datasets can accelerate the exploration of long-tail inputs. Having access to diverse inputs, however, is not sufficient to expose failures; an effective test also requires an oracle to distinguish between correct and incorrect behaviors. Current datasets lack such oracles and developing them is notoriously difficult. In response, we propose DiffTest4AV, a differential testing framework designed to address the unique challenges of testing AV systems: 1) for any given input, many outputs may be considered acceptable, 2) the long tail contains an insurmountable number of inputs to explore, and 3) the AV’s continuous execution loop requires failures to persist in order to affect the system. DiffTest4AV integrates statistical analysis to identify meaningful behavioral variations, judges their importance in terms of the severity of these differences, and incorporates sequential analysis to detect persistent errors indicative of potential system-level failures. Our study on 5 versions of the commercially-available, road-deployed comma.ai OpenPilot system, using 3 available image datasets, demonstrates the capabilities of the framework to detect high-severity, high-confidence, long-running test failures.
more » « less
Free, publicly-accessible full text available May 1, 2026
ODD-diLLMma: Driving Automation System ODD Compliance Checking using LLMs

https://doi.org/10.1109/IROS58592.2024.10801369

Hildebrandt, Carl; Woodlief, Trey; Elbaum, Sebastian (October 2024, IEEE)

Full Text Available
Automated Generation of Transformations to Mitigate Sensor Hardware Migration in ADS

https://doi.org/10.1109/LRA.2024.3405810

von_Stein, Meriel; Wang, Hongning; Elbaum, Sebastian (July 2024, IEEE Robotics and Automation Letters)

Full Text Available
Specifying and Monitoring Safe Driving Properties with Scene Graphs

https://doi.org/10.1109/ICRA57147.2024.10610973

Toledo, Felipe; Woodlief, Trey; Elbaum, Sebastian; Dwyer, Matthew B (May 2024, IEEE)

Full Text Available
S3C: Spatial Semantic Scene Coverage for Autonomous Vehicles

https://doi.org/10.1145/3597503.3639178

Woodlief, Trey; Toledo, Felipe; Elbaum, Sebastian; Dwyer, Matthew B (April 2024, ACM)
Training for Verification: Increasing Neuron Stability to Scale DNN Verification

https://doi.org/10.1007/978-3-031-57256-2_2

Xu, Dong; Mozumder, Nusrat J; Duong, Hai; Dwyer, Matthew B (April 2024, 30th International Conference Tools and Algorithms for the Construction and Analysis of Systems)
Finkbeiner, Bernd; Kovacs, Laura (Ed.)
With the growing use of deep neural networks(DNN) in mis- sion and safety-critical applications, there is an increasing interest in DNN verification. Unfortunately, increasingly complex network struc- tures, non-linear behavior, and high-dimensional input spaces combine to make DNN verification computationally challenging. Despite tremen- dous advances, DNN verifiers are still challenged to scale to large ver- ification problems. In this work, we explore how the number of stable neurons under the precondition of a specification gives rise to verifica- tion complexity. We examine prior work on the problem, adapt it, and develop several novel approaches to increase stability. We demonstrate that neuron stability can be increased substantially without compromis- ing model accuracy and this yields a multi-fold improvement in DNN verifier performance.
more » « less
Full Text Available
Deeper Notions of Correctness in Image-Based DNNs: Lifting Properties from Pixel to Entities

https://doi.org/10.1145/3611643.3613079

Toledo, Felipe; Shriver, David; Elbaum, Sebastian; Dwyer, Matthew B. (November 2023, ACM)
DeepManeuver: Adversarial Test Generation for Trajectory Manipulation of Autonomous Vehicles

https://doi.org/10.1109/TSE.2023.3301443

von Stein, Meriel; Shriver, David; Elbaum, Sebastian (October 2023, IEEE Transactions on Software Engineering)

Full Text Available

Search for: All records