Learning Fixed Points of Recurrent Neural Networks by Reparameterizing the Network Model

Zhu, Vicky; Rosenbaum, Robert

doi:10.1162/neco_a_01681

Citation Details

Learning Fixed Points of Recurrent Neural Networks by Reparameterizing the Network Model

In computational neuroscience, recurrent neural networks are widely used to model neural activity and learning. In many studies, fixed points of recurrent neural networks are used to model neural responses to static or slowly changing stimuli, such as visual cortical responses to static visual stimuli. These applications raise the question of how to train the weights in a recurrent neural network to minimize a loss function evaluated on fixed points. In parallel, training fixed points is a central topic in the study of deep equilibrium models in machine learning. A natural approach is to use gradient descent on the Euclidean space of weights. We show that this approach can lead to poor learning performance due in part to singularities that arise in the loss surface. We use a reparameterization of the recurrent network model to derive two alternative learning rules that produce more robust learning dynamics. We demonstrate that these learning rules avoid singularities and learn more effectively than standard gradient descent. The new learning rules can be interpreted as steepest descent and gradient descent, respectively, under a non-Euclidean metric on the space of recurrent weights. Our results question the common, implicit assumption that learning in the brain should be expected to follow the negative Euclidean gradient of synaptic weights. more »

Award ID(s):: 1707400

PAR ID:: 10566303

Author(s) / Creator(s):: Zhu, Vicky; Rosenbaum, Robert

Publisher / Repository:: Neural Computation

Date Published:: 2024-07-19

Journal Name:: Neural Computation

Volume:: 36

Issue:: 8

ISSN:: 0899-7667

Page Range / eLocation ID:: 1568 to 1600

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1162/neco_a_01681

More Like this