Stochastic Computing Architectures for Lightweight LSTM Neural Networks

Sengupta, Roshwin; Polian, Ilia; Hayes, John P.

Citation Details

For emerging edge and near-sensor systems to perform hard classification tasks locally, they must avoid costly communication with the cloud. This requires the use of compact classifiers such as recurrent neural networks of the long short term memory (LSTM) type, as well as a low-area hardware technology such as stochastic computing (SC). We study the benefits and costs of applying SC to LSTM design. We consider a design space spanned by fully binary (non-stochastic), fully stochastic, and several hybrid (mixed) LSTM architectures, and design and simulate examples of each. Using standard classification benchmarks, we show that area and power can be reduced up to 47% and 86% respectively with little or no impact on classification accuracy. We demonstrate that fully stochastic LSTMs can deliver acceptable accuracy despite accumulated errors. Our results also suggest that ReLU is preferable to tanh as an activation function in stochastic LSTMs more »

Award ID(s):: 2006704

PAR ID:: 10324354

Author(s) / Creator(s):: Sengupta, Roshwin; Polian, Ilia; Hayes, John P.

Date Published:: 2022-04-01

Journal Name:: 2022 25th International Symposium on Design and Diagnostics of Electronic Circuits & Systems (DDECS)

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this