Soft Error Resilient Deep Learning Systems Using Neuron Gradient Statistics

Amarnath, Chandramouli; Mejri, Mohamed; Ma, Kwondo; Chatterjee, Abhijit

doi:10.1109/IOLTS56730.2022.9897815

Citation Details

Soft Error Resilient Deep Learning Systems Using Neuron Gradient Statistics

Deep learning techniques have been widely adopted in daily life with applications ranging from face recognition to recommender systems. The substantial overhead of conventional error tolerance techniques precludes their widespread use, while approaches involving median filtering and invariant generation rely on alterations to DNN training that may be difficult to achieve for larger networks on larger datasets. To address this issue, this paper presents a novel approach taking advantage of the statistics of neuron output gradients to identify and suppress erroneous neuron values. By using the statistics of neurons’ gradients with respect to their neighbors, tighter statistical thresholds are obtained compared to the use of neuron output values alone. This approach is modular and is combined with accurate, low-overhead error detection methods to ensure it is used only when needed, further reducing its cost. Deep learning models can be trained using standard methods and our error correction module is fit to a trained DNN, achieving comparable or superior performance compared to baseline error correction methods while incurring comparable hardware overhead without needing to modify DNN training or utilize specialized hardware architectures. more »

Award ID(s):: 2128419

PAR ID:: 10453090

Author(s) / Creator(s):: Amarnath, Chandramouli; Mejri, Mohamed; Ma, Kwondo; Chatterjee, Abhijit

Date Published:: 2022-09-12

Journal Name:: International On-Line Testing Symposium

Page Range / eLocation ID:: 1 to 6

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/IOLTS56730.2022.9897815

More Like this