Efficient Adversarial Training With Transferable Adversarial Examples

Zheng, Haizhong; Zhang, Ziqi; Gu, Juncheng; Lee, Honglak; Prakash, Atul

doi:10.1109/CVPR42600.2020.00126

Citation Details

Efficient Adversarial Training With Transferable Adversarial Examples

Adversarial training is an effective defense method to protect classification models against adversarial attacks. However, one limitation of this approach is that it can re- quire orders of magnitude additional training time due to high cost of generating strong adversarial examples dur- ing training. In this paper, we first show that there is high transferability between models from neighboring epochs in the same training process, i.e., adversarial examples from one epoch continue to be adversarial in subsequent epochs. Leveraging this property, we propose a novel method, Adversarial Training with Transferable Adversarial Examples (ATTA), that can enhance the robustness of trained models and greatly improve the training efficiency by accumulating adversarial perturbations through epochs. Compared to state-of-the-art adversarial training methods, ATTA enhances adversarial accuracy by up to 7.2% on CIFAR10 and requires 12 ∼ 14× less training time on MNIST and CIFAR10 datasets with comparable model robustness. more »

Award ID(s):: 1646392

PAR ID:: 10205623

Author(s) / Creator(s):: Zheng, Haizhong; Zhang, Ziqi; Gu, Juncheng; Lee, Honglak; Prakash, Atul

Date Published:: 2020-06-01

Journal Name:: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Page Range / eLocation ID:: 1178 to 1187

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/CVPR42600.2020.00126

More Like this