Convex Optimization for Shallow Neural Networks

Ergen, Tolga; Pilanci, Mert

doi:10.1109/ALLERTON.2019.8919769

Citation Details

Convex Optimization for Shallow Neural Networks

We consider non-convex training of shallow neural networks and introduce a convex relaxation approach with theoretical guarantees. For the single neuron case, we prove that the relaxation preserves the location of the global minimum under a planted model assumption. Therefore, a globally optimal solution can be efficiently found via a gradient method. We show that gradient descent applied on the relaxation always outperforms gradient descent on the original non-convex loss with no additional computational cost. We then characterize this relaxation as a regularizer and further introduce extensions to multineuron single hidden layer networks. more »

Award ID(s):: 1838179

PAR ID:: 10128378

Author(s) / Creator(s):: Ergen, Tolga; Pilanci, Mert

Date Published:: 2019-09-01

Journal Name:: 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton)

Page Range / eLocation ID:: 79 to 83

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ALLERTON.2019.8919769

More Like this