NESTA: Hamming Weight Compression-Based Neural Proc. EngineAli Mirzaeian

Mirzaeian, Ali; Homayoun, Houman; Sasan, Avesta

doi:10.1109/ASP-DAC47756.2020.9045135

Citation Details

NESTA: Hamming Weight Compression-Based Neural Proc. EngineAli Mirzaeian

In this paper, we present NESTA, a specialized Neural engine that significantly accelerates the computation of convolution layers in a deep convolutional neural network, while reducing the computational energy. NESTA reformats Convolutions into 3 × 3 batches and uses a hierarchy of Hamming Weight Compressors to process each batch. Besides, when processing the convolution across multiple channels, NESTA, rather than computing the precise result of a convolution per channel, quickly computes an approximation of its partial sum, and a residual value such that if added to the approximate partial sum, generates the accurate output. Then, instead of immediately adding the residual, it uses (consumes) the residual when processing the next batch in the hamming weight compressors with available capacity. This mechanism shortens the critical path by avoiding the need to propagate carry signals during each round of computation and speeds up the convolution of each channel. In the last stage of computation, when the partial sum of the last channel is computed, NESTA terminates by adding the residual bits to the approximate output to generate a correct result. more »

Award ID(s):: 1718538 2146726

PAR ID:: 10182706

Author(s) / Creator(s):: Mirzaeian, Ali; Homayoun, Houman; Sasan, Avesta

Date Published:: 2020-01-01

Journal Name:: ASP-DAC

Page Range / eLocation ID:: 530 to 537

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ASP-DAC47756.2020.9045135

More Like this