Random Smoothing Might be Unable to Certify L_infinity Robustness for High-Dimensional Images

Blum, Avrim; Dick, Travis; Manoj, Naren; Zhang, Hongyang

Citation Details

We show a hardness result for random smoothing to achieve certified adversarial robustness against attacks in the ℓp ball of radius ϵ when p>2. Although random smoothing has been well understood for the ℓ2 case using the Gaussian distribution, much remains unknown concerning the existence of a noise distribution that works for the case of p>2. This has been posed as an open problem by Cohen et al. (2019) and includes many significant paradigms such as the ℓ∞ threat model. In this work, we show that any noise distribution D over R^d that provides ℓp robustness for all base classifiers with p>2 must satisfy E[η_i^2]= Ω(d^(1−2/p) ϵ^2 (1−δ)/δ^2) for 99% of the features (pixels) of vector η∼D, where ϵ is the robust radius and δ is the score gap between the highest-scored class and the runner-up. Therefore, for high-dimensional images with pixel values bounded in [0,255], the required noise will eventually dominate the useful information in the images, leading to trivial smoothed classifiers. more »

Award ID(s):: 1815011

NSF-PAR ID:: 10202005

Author(s) / Creator(s):: Blum, Avrim; Dick, Travis; Manoj, Naren; Zhang, Hongyang

Date Published:: 2020-01-01

Journal Name:: Journal of machine learning research

Volume:: 21

Issue:: 211

ISSN:: 1532-4435

Page Range / eLocation ID:: 1-21

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this