Geometry of Optimization and Implicit Regularization in Deep Learning

Neyshabur, Behnam; Tomioka, Ryota; Salakhutdinov, Ruslan; Srebro, Nathan

Citation Details

We argue that the optimization plays a crucial role in generalization of deep learning models through implicit regularization. We do this by demonstrating that generalization ability is not controlled by network size but rather by some other implicit control. We then demonstrate how changing the empirical optimization procedure can improve generalization, even if actual optimization quality is not affected. We do so by studying the geometry of the parameter space of deep networks, and devising an optimization algorithm attuned to this geometry. more »

Award ID(s):: 1302662

PAR ID:: 10025956

Author(s) / Creator(s):: Neyshabur, Behnam; Tomioka, Ryota; Salakhutdinov, Ruslan; Srebro, Nathan

Date Published:: 2017-05-08

Journal Name:: arXiv.org

ISSN:: 2331-8422

Page Range / eLocation ID:: arXiv:1705.03071v1 [cs.LG]

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this