Adversarial Examples Might be Avoidable: The Role of Data Concentration in Adversarial Robustness

Pal, Ambar; Sulam, Jeremias; Vidal, Rene

Citation Details

The susceptibility of modern machine learning classifiers to adversarial examples has motivated theoretical results suggesting that these might be unavoidable. However, these results can be too general to be applicable to natural data distributions. Indeed, humans are quite robust for tasks involving vision. This apparent conflict motivates a deeper dive into the question: Are adversarial examples truly unavoidable? In this work, we theoretically demonstrate that a key property of the data distribution – concentration on small-volume subsets of the input space – determines whether a robust classifier exists. We further demonstrate that, for a data distribution concentrated on a union of low-dimensional linear subspaces, utilizing structure in data naturally leads to classifiers that enjoy data-dependent polyhedral robustness guarantees, improving upon methods for provable certification in certain regimes. more »

Award ID(s):: 2212457

PAR ID:: 10528042

Author(s) / Creator(s):: Pal, Ambar; Sulam, Jeremias; Vidal, Rene

Editor(s):: Oh, A; Naumann, T; Globerson, A; Saenko, K; Hardt, M; Levine, S

Publisher / Repository:: NeurIPS

Date Published:: 2024-05-30

ISSN:: 1049-5258

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this