Automatically Detecting Numerical Instability in Machine Learning Applications via Soft Assertions

Sharmin, Shaila; Zahid, Anwar Hossain; Bhattacharjee, Subhankar; Igwilo, Chiamaka; Kim, Miryung; Le, Wei

doi:10.1145/3729394

Citation Details

This content will become publicly available on June 19, 2026

Automatically Detecting Numerical Instability in Machine Learning Applications via Soft Assertions

Machine learning (ML) applications have become an integral part of our lives. ML applications extensively use floating-point computation and involve very large/small numbers; thus, maintaining the numerical stability of such complex computations remains an important challenge. Numerical bugs can lead to system crashes, incorrect output, and wasted computing resources. In this paper, we introduce a novel idea, namelysoft assertions (SA), to encode safety/error conditions for the places where numerical instability can occur. A soft assertion is an ML model automatically trained using the dataset obtained during unit testing of unstable functions. Given the values at the unstable function in an ML application, a soft assertion reports how to change these values in order to trigger the instability. We then use the output of soft assertions as signals to effectively mutate inputs to trigger numerical instability in ML applications. In the evaluation, we used the GRIST benchmark, a total of 79 programs, as well as 15 real-world ML applications from GitHub. We compared our tool with 5 state-of-the-art (SOTA) fuzzers. We found all the GRIST bugs and outperformed the baselines. We found 13 numerical bugs in real-world code, one of which had already been confirmed by the GitHub developers. While the baselines mostly found the bugs that report NaN and INF, our tool found numerical bugs with incorrect output. We showed one case where theTumor Detection Model, trained on Brain MRI images, should have predicted ”tumor”, but instead, it incorrectly predicted ”no tumor” due to the numerical bugs. Our replication package is located at https://figshare.com/s/6528d21ccd28bea94c32. more »

Award ID(s):: 2313054

PAR ID:: 10629703

Author(s) / Creator(s):: Sharmin, Shaila; Zahid, Anwar Hossain; Bhattacharjee, Subhankar; Igwilo, Chiamaka; Kim, Miryung; Le, Wei

Publisher / Repository:: Proceedings of the ACM on Software Engineering

Date Published:: 2025-06-19

Journal Name:: Proceedings of the ACM on Software Engineering

Volume:: 2

Issue:: FSE

ISSN:: 2994-970X

Page Range / eLocation ID:: 2806 to 2827

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on June 19, 2026
Journal Article:
https://doi.org/10.1145/3729394

More Like this