Stability and Generalization of Adversarial Training for Shallow Neural Networks with Smooth Activation

Zhang, Kaibo; Wang, Yunjuan; Arora, Raman

Citation Details

This content will become publicly available on December 1, 2025

Stability and Generalization of Adversarial Training for Shallow Neural Networks with Smooth Activation

Adversarial training has emerged as a popular approach for training models that are robust to inference-time adversarial attacks. However, our theoretical understanding of why and when it works remains limited. Prior work has offered generalization analysis of adversarial training, but they are either restricted to the Neural Tangent Kernel (NTK) regime or they make restrictive assumptions about data such as (noisy) linear separability or robust realizability. In this work, we study the stability and generalization of adversarial training for two-layer networks without any data distribution assumptions and beyond the NTK regime. Our findings suggest that for networks with any given initialization and sufficiently large width, the generalization bound can be effectively controlled via early stopping. We further improve the generalization bound by leveraging smoothing using Moreau’s envelope. more »

Award ID(s):: 1943251

PAR ID:: 10572984

Author(s) / Creator(s):: Zhang, Kaibo; Wang, Yunjuan; Arora, Raman

Publisher / Repository:: 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

Date Published:: 2024-12-01

ISSN:: 1049-5258

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on December 1, 2025
Conference Paper:
The DOI is not currently available.

More Like this