One Pixel Can Change the Diagnosis: Adversarial andNon-Adversarial Robustness and Uncertainty in BreastUltrasound Classification Model

Huang, Kuan; Sahel, Noorul; Karki, Dikshya; Xu, Meng; Wang, Yingfeng

doi:10.1609/aaaiss.v7i1.36927

Citation Details

This content will become publicly available on November 23, 2026

One Pixel Can Change the Diagnosis: Adversarial andNon-Adversarial Robustness and Uncertainty in BreastUltrasound Classification Model

Deep learning models have strong potential for automating breast ultrasound (BUS) image classification to support early cancer detection. However, their vulnerability to small input perturbations poses a challenge for clinical reliability. This study examines how minimal pixel-level changes affect classification performance and predictive uncertainty, using the BUSI dataset and a ResNet-50 classifier. Two perturbation types are evaluated: (1) adversarial perturbations via the One Pixel Attack and (2) non-adversarial, device-related noise simulated by setting a single pixel to black. Robustness is assessed alongside uncertainty estimation using Monte Carlo Dropout, with metrics including Expected Kullback–Leibler divergence (EKL), Predictive Variance (PV), and Mutual Information (MI) for epistemic uncertainty, and Maximum Class Probability (MP) for aleatoric uncertainty. Both perturbations reduced accuracy, producing 17 and 29 “fooled” test samples, defined as cases classified correctly before but incorrectly after perturbation, for the adversarial and non-adversarial settings, respectively. Samples that remained correct are referred to as “unfooled.” Across all metrics, uncertainty increased after perturbation for both groups, and fooled samples had higher uncertainty than unfooled samples even before perturbation. We also identify spatially localized “uncertainty-decreasing” regions, where individual single-pixel blackouts both flipped predictions and reduced uncertainty, creating overconfident errors. These regions represent high-risk vulnerabilities that could be exploited in adversarial attacks or addressed through targeted robustness training and uncertainty-aware safeguards. Overall, combining perturbation analysis with uncertainty quantification provides valuable insights into model weaknesses and can inform the design of safer, more reliable AI systems for BUS diagnosis. more »

Award ID(s):: 2430746 2430747

PAR ID:: 10658275

Author(s) / Creator(s):: Huang, Kuan; Sahel, Noorul; Karki, Dikshya; Xu, Meng; Wang, Yingfeng

Publisher / Repository:: Association for the Advancement of Artificial Intelligence

Date Published:: 2025-11-23

Journal Name:: Proceedings of the AAAI Symposium Series

Volume:: 7

Issue:: 1

ISSN:: 2994-4317

Page Range / eLocation ID:: 524 to 529

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on November 23, 2026
Journal Article:
https://doi.org/10.1609/aaaiss.v7i1.36927

More Like this