PCA as a defense against some adversaries

Gupte, Aparna; Banburski, Andrzej; Poggio, Tomaso

Citation Details

Neural network classifiers are known to be highly vulnerable to adversarial perturbations in their inputs. Under the hypothesis that adversarial examples lie outside of the sub-manifold of natural images, previous work has investigated the impact of principal components in data on adversarial robustness. In this paper we show that there exists a very simple defense mechanism in the case where adversarial images are separable in a previously defined $(k,p)$ metric. This defense is very successful against the popular Carlini-Wagner attack, but less so against some other common attacks like FGSM. It is interesting to note that the defense is still successful for relatively large perturbations. more »

Award ID(s):: 2134108

PAR ID:: 10565470

Author(s) / Creator(s):: Gupte, Aparna; Banburski, Andrzej; Poggio, Tomaso

Publisher / Repository:: Center for Brains, Minds and Machines (CBMM)

Date Published:: 2022-03-30

Format(s):: Medium: X

Institution:: Massachusetts Institute of Technology

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Posted Content:
The DOI is not currently available.

More Like this