Generation of Low Distortion Adversarial Attacks via Convex Programming

Zhang, T; Liu, S; Wang, Y; Fardad, M

doi:10.1109/ICDM.2019.00195

Citation Details

Generation of Low Distortion Adversarial Attacks via Convex Programming

As deep neural networks (DNNs) achieve extraordi- nary performance in a wide range of tasks, testing their robust- ness under adversarial attacks becomes paramount. Adversarial attacks, also known as adversarial examples, are used to measure the robustness of DNNs and are generated by incorporating imperceptible perturbations into the input data with the intention of altering a DNN’s classification. In prior work in this area, most of the proposed optimization based methods employ gradient descent to find adversarial examples. In this paper, we present an innovative method which generates adversarial examples via convex programming. Our experiment results demonstrate that we can generate adversarial examples with lower distortion and higher transferability than the C&W attack, which is the current state-of-the-art adversarial attack method for DNNs. We achieve 100% attack success rate on both the original undefended models and the adversarially-trained models. Our distortions of the L∞ attack are respectively 31% and 18% lower than the C&W attack for the best case and average case on the CIFAR-10 data set. more »

Award ID(s):: 1750531

PAR ID:: 10287609

Author(s) / Creator(s):: Zhang, T; Liu, S; Wang, Y; Fardad, M

Date Published:: 2019-01-01

Journal Name:: IEEE International Conference on Data Mining

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICDM.2019.00195

More Like this