Empirically Measuring Concentration: Fundamental Limits on Intrinsic Robustness

Mahloujifar, Saeed; Zhang, Xiao; Mahmoody, Mohammad; Evans, David

Citation Details

Many recent works have shown that adversarial examples that fool classifiers can be found by minimally perturbing a normal input. Recent theoretical results, starting with Gilmer et al. (2018), show that if the inputs are drawn from a concentrated metric probability space, then adversarial examples with small perturbation are inevitable. A concentrated space has the property that any subset with Ω(1) (e.g., 1/100) measure, according to the imposed distribution, has small distance to almost all (e.g., 99/100) of the points in the space. It is not clear, however, whether these theoretical results apply to actual distributions such as images. This paper presents a method for empirically measuring and bounding the concentration of a concrete dataset which is proven to converge to the actual concentration. We use it to empirically estimate the intrinsic robustness to ℓ∞ and ℓ2 perturbations of several image classification benchmarks. more »

Award ID(s):: 1804603

NSF-PAR ID:: 10110799

Author(s) / Creator(s):: Mahloujifar, Saeed; Zhang, Xiao; Mahmoody, Mohammad; Evans, David

Date Published:: 2019-01-01

Journal Name:: ICLR Workshop on Debugging Machine Learning Models

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this