Norm-based Generalization Bounds for Sparse Neural Networks

Galanti, Tomer; Xu, Mengjia; Galanti, Liane; Poggio, Tomaso

Citation Details

In this paper, we derive norm-based generalization bounds for sparse ReLU neural networks, including convolutional neural networks. These bounds differ from previous ones because they consider the sparse structure of the neural network architecture and the norms of the convolutional filters, rather than the norms of the (Toeplitz) matrices associated with the convolutional layers. Theoretically, we demonstrate that these bounds are significantly tighter than standard norm-based generalization bounds. Empirically, they offer relatively tight estimations of generalization for various simple classification problems. Collectively, these findings suggest that the sparsity of the underlying target function and the model’s architecture plays a crucial role in the success of deep learning. more »

Award ID(s):: 2134108

PAR ID:: 10565441

Author(s) / Creator(s):: Galanti, Tomer; Xu, Mengjia; Galanti, Liane; Poggio, Tomaso

Publisher / Repository:: Advances in Neural Information Processing Systems

Date Published:: 2023-12-15

Volume:: 36

Format(s):: Medium: X

Location:: New Orleans

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this