Fit without fear: remarkable mathematical phenomena of deep learning through the prism of interpolation

Belkin, Mikhail

doi:10.1017/S0962492921000039

Citation Details

Fit without fear: remarkable mathematical phenomena of deep learning through the prism of interpolation

In the past decade the mathematical theory of machine learning has lagged far behind the triumphs of deep neural networks on practical challenges. However, the gap between theory and practice is gradually starting to close. In this paper I will attempt to assemble some pieces of the remarkable and still incomplete mathematical mosaic emerging from the efforts to understand the foundations of deep learning. The two key themes will be interpolation and its sibling over-parametrization. Interpolation corresponds to fitting data, even noisy data, exactly. Over-parametrization enables interpolation and provides flexibility to select a suitable interpolating model. As we will see, just as a physical prism separates colours mixed within a ray of light, the figurative prism of interpolation helps to disentangle generalization and optimization properties within the complex picture of modern machine learning. This article is written in the belief and hope that clearer understanding of these issues will bring us a step closer towards a general theory of deep learning and machine learning. more »

Award ID(s):: 2050360

PAR ID:: 10294897

Author(s) / Creator(s):: Belkin, Mikhail

Date Published:: 2021-05-01

Journal Name:: Acta Numerica

Volume:: 30

ISSN:: 0962-4929

Page Range / eLocation ID:: 203 to 248

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1017/S0962492921000039

More Like this