Running sparse and low-precision neural network: When algorithm meets hardware

Li, Bing; Wen, Wei; Mao, Jiachen; Li, Sicheng; Chen, Yiran; Li, Hai Helen

doi:10.1109/ASPDAC.2018.8297378

Citation Details

Running sparse and low-precision neural network: When algorithm meets hardware

Deep Neural Networks (DNNs) are pervasively applied in many artificial intelligence (AI) applications. The high performance of DNNs comes at the cost of larger size and higher compute complexity. Recent studies show that DNNs have much redundancy, such as the zero-value parameters and excessive numerical precision. To reduce computing complexity, many redundancy reduction techniques have been proposed, including pruning and data quantization. In this paper, we demonstrate our cooptimization of the DNN algorithm and hardware which exploits the model redundancy to accelerate DNNs. more »

Award ID(s):: 1717657 1725456

PAR ID:: 10063492

Author(s) / Creator(s):: Li, Bing; Wen, Wei; Mao, Jiachen; Li, Sicheng; Chen, Yiran; Li, Hai Helen

Date Published:: 2018-01-01

Journal Name:: Asia and South Pacific Design Automation Conference (ASP-DAC)

Page Range / eLocation ID:: 534 to 539

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ASPDAC.2018.8297378

More Like this