Understanding and Mitigating the Tradeoff between Robustness and Accuracy

Aditi Raghunathan, Aditi; Michael, Sang; Xie, Michael Xie; Yang, Fanny; Duchi, John; Liang, Percy

Citation Details

Adversarial training augments the training set with perturbations to improve the robust error (over worst-case perturbations), but it often leads to an increase in the standard error (on unperturbed test inputs). Previous explanations for this tradeoff rely on the assumption that no predictor in the hypothesis class has low standard and robust error. In this work, we precisely characterize the effect of augmentation on the standard error in linear regression when the optimal linear predictor has zero standard and robust error. In particular, we show that the standard error could increase even when the augmented perturbations have noiseless observations from the optimal linear predictor. We then prove that the recently proposed robust self-training (RST) estimator improves robust error without sacrificing standard error for noiseless linear regression. Empirically, for neural networks, we find that RST with different adversarial training methods improves both standard and robust error for random and adversarial rotations and adversarial l_infty perturbations in CIFAR-10. more »

Award ID(s):: 2343611

PAR ID:: 10472131

Author(s) / Creator(s):: Aditi Raghunathan, Aditi; Michael, Sang; Xie, Michael Xie; Yang, Fanny; Duchi, John; Liang, Percy

Publisher / Repository:: Proceedings of the 37th International Conference on Machine Learning, PMLR

Date Published:: 2022-06-28

Format(s):: Medium: X

Location:: Proceedings of the 37th International Conference on Machine Learning, PMLR

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this