Overparameterized Random Feature Regression with Nearly Orthogonal Data

Wang, Zhichao; Zhu, Yizhe

Citation Details

We investigate the properties of random feature ridge regression (RFRR) given by a two-layer neural network with random Gaussian initialization. We study the non-asymptotic behaviors of the RFRR with nearly orthogonal deterministic unit-length input data vectors in the overparameterized regime, where the width of the first layer is much larger than the sample size. Our analysis shows high-probability non-asymptotic concentration results for the training errors, cross-validations, and generalization errors of RFRR centered around their respective values for a kernel ridge regression (KRR). This KRR is derived from an expected kernel generated by a nonlinear random feature map. We then approximate the performance of the KRR by a polynomial kernel matrix obtained from the Hermite polynomial expansion of the activation function, whose degree only depends on the orthogonality among different data points. This polynomial kernel determines the asymptotic behavior of the RFRR and the KRR. Our results hold for a wide variety of activation functions and input data sets that exhibit nearly orthogonal properties. Based on these approximations, we obtain a lower bound for the generalization error of the RFRR for a nonlinear student-teacher model. more »

Award ID(s):: 2154099

PAR ID:: 10540436

Author(s) / Creator(s):: Wang, Zhichao; Zhu, Yizhe

Editor(s):: Ruiz, Francisco; Dy, Jennifer; van_de_Meent, Jan-Willem

Publisher / Repository:: Proceedings of Machine Learning Research

Date Published:: 2023-04-27

Volume:: 206

ISSN:: 2640-3498

Page Range / eLocation ID:: 8463-8493

Format(s):: Medium: X

Location:: Valencia, Spain

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this