Stable tensor neural networks for efficient deep learning

Newman, Elizabeth; Horesh, Lior; Avron, Haim; Kilmer, Misha E

doi:10.3389/fdata.2024.1363978

Citation Details

Stable tensor neural networks for efficient deep learning

Learning from complex, multidimensional data has become central to computational mathematics, and among the most successful high-dimensional function approximators are deep neural networks (DNNs). Training DNNs is posed as an optimization problem to learn network weights or parameters that well-approximate a mapping from input to target data. Multiway data or tensors arise naturally in myriad ways in deep learning, in particular as input data and as high-dimensional weights and features extracted by the network, with the latter often being a bottleneck in terms of speed and memory. In this work, we leverage tensor representations and processing to efficiently parameterize DNNs when learning from high-dimensional data. We propose tensor neural networks (t-NNs), a natural extension of traditional fully-connected networks, that can be trained efficiently in a reduced, yet more powerful parameter space. Our t-NNs are built upon matrix-mimetic tensor-tensor products, which retain algebraic properties of matrix multiplication while capturing high-dimensional correlations. Mimeticity enables t-NNs to inherit desirable properties of modern DNN architectures. We exemplify this by extending recent work on stable neural networks, which interpret DNNs as discretizations of differential equations, to our multidimensional framework. We provide empirical evidence of the parametric advantages of t-NNs on dimensionality reduction using autoencoders and classification using fully-connected and stable variants on benchmark imaging datasets MNIST and CIFAR-10. more »

Award ID(s):: 2309751

PAR ID:: 10533088

Author(s) / Creator(s):: Newman, Elizabeth; Horesh, Lior; Avron, Haim; Kilmer, Misha E

Editor(s):: Zhang, Yanqing

Publisher / Repository:: Frontiers

Date Published:: 2024-05-30

Journal Name:: Frontiers in Big Data

Volume:: 7

ISSN:: 2624-909X

Subject(s) / Keyword(s):: tensor algebra, deep learning, machine learning, image classification, inverse problems

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.3389/fdata.2024.1363978

More Like this