Tolerating Defects in Low-Power Neural Network Accelerators Via Retraining-Free Weight Approximation

Hosseini, Fateme S.; Meng, Fanruo; Yang, Chengmo; Wen, Wujie; Cammarota, Rosario

doi:10.1145/3477016

Citation Details

Tolerating Defects in Low-Power Neural Network Accelerators Via Retraining-Free Weight Approximation

Hardware accelerators are essential to the accommodation of ever-increasing Deep Neural Network (DNN) workloads on the resource-constrained embedded devices. While accelerators facilitate fast and energy-efficient DNN operations, their accuracy is threatened by faults in their on-chip and off-chip memories, where millions of DNN weights are held. The use of emerging Non-Volatile Memories (NVM) further exposes DNN accelerators to a non-negligible rate of permanent defects due to immature fabrication, limited endurance, and aging. To tolerate defects in NVM-based DNN accelerators, previous work either requires extra redundancy in hardware or performs defect-aware retraining, imposing significant overhead. In comparison, this paper proposes a set of algorithms that exploit the flexibility in setting the fault-free bits in weight memory to effectively approximate weight values, so as to mitigate defect-induced accuracy drop. These algorithms can be applied as a one-step solution when loading the weights to embedded devices. They only require trivial hardware support and impose negligible run-time overhead. Experiments on popular DNN models show that the proposed techniques successfully boost inference accuracy even in the face of elevated defect rates in the weight memory. more »

Award ID(s):: 2011236 1909854

PAR ID:: 10296238

Author(s) / Creator(s):: Hosseini, Fateme S.; Meng, Fanruo; Yang, Chengmo; Wen, Wujie; Cammarota, Rosario

Date Published:: 2021-10-31

Journal Name:: ACM Transactions on Embedded Computing Systems

Volume:: 20

Issue:: 5s

ISSN:: 1539-9087

Page Range / eLocation ID:: 1 to 21

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1145/3477016

More Like this