Nonparametric Classification on Low Dimensional Manifolds using Overparameterized Convolutional Residual Networks

Zhang, Zixuan; Zhang, Kaiqi; Chen, Minshuo; Takeda, Yuma; Wang, Mengdi; Zhao, Tuo; Wang, Yu-Xiang

Citation Details

Convolutional residual neural networks (ConvResNets), though overparameterized, can achieve remarkable prediction performance in practice, which cannot be well explained by conventional wisdom. To bridge this gap, we study the performance of ConvResNeXts, which cover ConvResNets as a special case, trained with weight decay from the perspective of nonparametric classification. Our analysis allows for infinitely many building blocks in ConvResNeXts, and shows that weight decay implicitly enforces sparsity on these blocks. Specifically, we consider a smooth target function supported on a low-dimensional manifold, then prove that ConvResNeXts can adapt to the function smoothness and low-dimensional structures and efficiently learn the function without suffering from the curse of dimensionality. Our findings partially justify the advantage of overparameterized ConvResNeXts over conventional machine learning models. more »

Award ID(s):: 2134214 2536920

PAR ID:: 10550187

Author(s) / Creator(s):: Zhang, Zixuan; Zhang, Kaiqi; Chen, Minshuo; Takeda, Yuma; Wang, Mengdi; Zhao, Tuo; Wang, Yu-Xiang

Publisher / Repository:: Advances in neural information processing systems (NeurIPS'2024)

Date Published:: 2024-10-22

Journal Name:: Advances in neural information processing systems

ISSN:: 1049-5258

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this