End-to-end Learning of a Convolutional Neural Network via Deep Tensor Decomposition.

Oymak, Samet; Soltanolkotabi, Mahdi

Citation Details

In this paper we study the problem of learning the weights of a deep convolutional neural network. We consider a network where convolutions are carried out over non-overlapping patches with a single kernel in each layer. We develop an algorithm for simultaneously learning all the kernels from the training data. Our approach dubbed Deep Tensor Decomposition (DeepTD1 ) is based on a rank-1 tensor decomposition. We theoretically investigate DeepTD under a realizable model for the training data where the inputs are chosen i.i.d. from a Gaussian distribution and the labels are generated according to planted convolutional kernels. We show that DeepTD is data-efficient and provably works as soon as the sample size exceeds the total number of convolutional weights in the network. We carry out a variety of numerical experiments to investigate the effectiveness of DeepTD and verify our theoretical findings. more »

Award ID(s):: 1846369 1813877

PAR ID:: 10316298

Author(s) / Creator(s):: Oymak, Samet; Soltanolkotabi, Mahdi

Date Published:: 2021-01-01

Journal Name:: Information and inference

ISSN:: 2049-8764

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this