Eigenvalue Normalized Recurrent Neural Networks for Short Term Memory

Helfrich, Kyle; Ye, Qiang

doi:10.1609/aaai.v34i04.5831

Citation Details

Eigenvalue Normalized Recurrent Neural Networks for Short Term Memory

Several variants of recurrent neural networks (RNNs) with orthogonal or unitary recurrent matrices have recently been developed to mitigate the vanishing/exploding gradient problem and to model long-term dependencies of sequences. However, with the eigenvalues of the recurrent matrix on the unit circle, the recurrent state retains all input information which may unnecessarily consume model capacity. In this paper, we address this issue by proposing an architecture that expands upon an orthogonal/unitary RNN with a state that is generated by a recurrent matrix with eigenvalues in the unit disc. Any input to this state dissipates in time and is replaced with new inputs, simulating short-term memory. A gradient descent algorithm is derived for learning such a recurrent matrix. The resulting method, called the Eigenvalue Normalized RNN (ENRNN), is shown to be highly competitive in several experiments. more »

Award ID(s):: 1821144 1620082

NSF-PAR ID:: 10167594

Author(s) / Creator(s):: Helfrich, Kyle; Ye, Qiang

Date Published:: 2020-06-16

Journal Name:: Proceedings of the AAAI Conference on Artificial Intelligence

Volume:: 34

Issue:: 04

ISSN:: 2159-5399

Page Range / eLocation ID:: 4115 to 4122

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1609/aaai.v34i04.5831

More Like this