Understanding Deflation Process in Over-parametrized Tensor Decomposition

Ge, R; Ren, Y.; Wang, X.; Zhou, M.

Citation Details

In this paper we study the training dynamics for gradient flow on overparametrized tensor decomposition problems. Empirically, such training process often first fits larger components and then discovers smaller components, which is similar to a tensor deflation process that is commonly used in tensor decomposition algorithms. We prove that for orthogonally decomposable tensor, a slightly modified version of gradient flow would follow a tensor deflation process and recover all the tensor components. Our proof suggests that for orthogonal tensors, gradient flow dynamics works similarly as greedy low-rank learning in the matrix setting, which is a first step towards understanding the implicit regularization effect of over-parametrized models for low-rank tensors. more »

Award ID(s):: 1845171 1704656 2031849

NSF-PAR ID:: 10335913

Author(s) / Creator(s):: Ge, R; Ren, Y.; Wang, X.; Zhou, M.

Date Published:: 2021-01-01

Journal Name:: Thirty-fifth Conference on Neural Information Processing Systems

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this