Explainable Adversarial Attacks on Coarse-to-Fine Classifiers

Heidarizadeh, Akram; Hatfield, Connor; Lazzarotto, Lorenzo; Cai, HanQin; Atia, George

doi:10.1109/ICASSP49660.2025.10889828

Citation Details

This content will become publicly available on April 6, 2026

Explainable Adversarial Attacks on Coarse-to-Fine Classifiers

Traditional adversarial attacks typically aim to alter the predicted labels of input images by generating perturbations that are imperceptible to the human eye. However, these approaches often lack explainability. Moreover, most existing work on adversarial attacks focuses on single-stage classifiers, but multi-stage classifiers are largely unexplored. In this paper, we introduce instance-based adversarial attacks for multi-stage classifiers, leveraging Layer-wise Relevance Propagation (LRP), which assigns relevance scores to pixels based on their influence on classification outcomes. Our approach generates explainable adversarial perturbations by utilizing LRP to identify and target key features critical for both coarse and fine-grained classifications. Unlike conventional attacks, our method not only induces misclassification but also enhances the interpretability of the model’s behavior across classification stages, as demonstrated by experimental results. more »

Award ID(s):: 2304489

PAR ID:: 10632631

Author(s) / Creator(s):: Heidarizadeh, Akram; Hatfield, Connor; Lazzarotto, Lorenzo; Cai, HanQin; Atia, George

Publisher / Repository:: IEEE International Conference on Acoustics, Speech and Signal Processing

Date Published:: 2025-04-06

ISBN:: 979-8-3503-6874-1

Page Range / eLocation ID:: 1 to 5

Format(s):: Medium: X

Location:: Hyderabad, India

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on April 6, 2026
Conference Paper:
https://doi.org/10.1109/ICASSP49660.2025.10889828

More Like this