An ASIC Accelerator for QNN With Variable Precision and Tunable Energy Efficiency

Wagle, Ankit; Singh, Gian; Khatri, Sunil; Vrudhula, Sarma

doi:10.1109/TCAD.2024.3357597

Citation Details

An ASIC Accelerator for QNN With Variable Precision and Tunable Energy Efficiency

This article presents TULIP, a new architecture for a variable precision quantized neural network (QNN) inference. It is designed with the goal of maximizing energy efficiency per classification. TULIP is constructed by arranging a collection of unique processing elements (TULIP-PEs) in a single-instruction–multiple-data (SIMD) fashion. Each TULIP-PE contains binary neurons that are interconnected using multiplexers. Each neuron also has a small dedicated local register connected to it. The binary neurons are implemented as standard cells and used for implementing threshold functions, i.e., an inner-product and thresholding operation on its binary inputs. The neurons can be reconfigured with a single change in the control signals to implement all the standard operations used in a QNN. This article presents novel algorithms for implementing the operations of a QNN on the TULIP-PEs in the form of a schedule of threshold functions. TULIP was implemented as an ASIC in TSMC 40nm-LP technology. A QNN accelerator that employs a conventional multiply and accumulate-based arithmetic processor was also implemented in the same technology to provide a fair comparison. The results show that TULIP is 30X−50X more energy-efficient than an equivalent design, without any penalty in performance, area, or accuracy. Furthermore, TULIP achieves these improvements without using traditional techniques such as voltage scaling or approximate computing. Finally, this article also demonstrates how the run-time tradeoff between accuracy and energy efficiency is done on the TULIP architecture. more »

Award ID(s):: 2008244

PAR ID:: 10521393

Author(s) / Creator(s):: Wagle, Ankit; Singh, Gian; Khatri, Sunil; Vrudhula, Sarma

Publisher / Repository:: IEEE

Date Published:: 2024-07-01

Journal Name:: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Volume:: 43

Issue:: 7

ISSN:: 0278-0070

Page Range / eLocation ID:: 2057 to 2070

Subject(s) / Keyword(s):: Neurons, Energy efficiency Computer architecture Artificial neural networks Training Field programmable gate arrays Throughput

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1109/TCAD.2024.3357597

More Like this