Fundamental Limits of Two-layer Autoencoders, and Achieving Them with Gradient Methods

Shevchenko, Aleksandr; Kogler, Kevin; Hassani, Hamed; Mondelli, Marco

Citation Details

Autoencoders are a popular model in many branches of machine learning and lossy data com- pression. However, their fundamental limits, the performance of gradient methods and the features learnt during optimization remain poorly under- stood, even in the two-layer setting. In fact, earlier work has considered either linear autoencoders or specific training regimes (leading to vanish- ing or diverging compression rates). Our paper addresses this gap by focusing on non-linear two- layer autoencoders trained in the challenging pro- portional regime in which the input dimension scales linearly with the size of the representation. Our results characterize the minimizers of the pop- ulation risk, and show that such minimizers are achieved by gradient methods; their structure is also unveiled, thus leading to a concise descrip- tion of the features obtained via training. For the special case of a sign activation function, our analysis establishes the fundamental limits for the lossy compression of Gaussian sources via (shal- low) autoencoders. Finally, while the results are proved for Gaussian data, numerical simulations on standard datasets display the universality of the theoretical predictions. more »

Award ID(s):: 1910056

PAR ID:: 10490384

Author(s) / Creator(s):: Shevchenko, Aleksandr; Kogler, Kevin; Hassani, Hamed; Mondelli, Marco

Publisher / Repository:: Proceedings of Machine Learning Research (PMLR)

Date Published:: 2023-07-12

Journal Name:: Proceedings of the 40th International Conference on Machine Learning

Format(s):: Medium: X

Location:: Hawaii

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this