Quantization-Based Optimization Algorithm for Hardware Implementation of Convolution Neural Networks

Mohd, Bassam J; Ahmad_Yousef, Khalil M; AlMajali, Anas; Hayajneh, Thaier

doi:10.3390/electronics13091727

Citation Details

Quantization-Based Optimization Algorithm for Hardware Implementation of Convolution Neural Networks

Convolutional neural networks (CNNs) have demonstrated remarkable performance in many areas but require significant computation and storage resources. Quantization is an effective method to reduce CNN complexity and implementation. The main research objective is to develop a scalable quantization algorithm for CNN hardware design and model the performance metrics for the purpose of CNN implementation in resource-constrained devices (RCDs) and optimizing layers in deep neural networks (DNNs). The algorithm novelty is based on blending two quantization techniques to perform full model quantization with optimum accuracy, and without additional neurons. The algorithm is applied to a selected CNN model and implemented on an FPGA. Implementing CNN using broad data is not possible due to capacity issues. With the proposed quantization algorithm, we succeeded in implementing the model on the FPGA using 16-, 12-, and 8-bit quantization. Compared to the 16-bit design, the 8-bit design offers a 44% decrease in resource utilization, and achieves power and energy reductions of 41% and 42%, respectively. Models show that trading off one quantization bit yields savings of approximately 5.4K LUTs, 4% logic utilization, 46.9 mW power, and 147 μJ energy. The models were also used to estimate performance metrics for a sample DNN design. more »

Award ID(s):: 2142229

PAR ID:: 10560326

Author(s) / Creator(s):: Mohd, Bassam J; Ahmad_Yousef, Khalil M; AlMajali, Anas; Hayajneh, Thaier

Publisher / Repository:: MDPI

Date Published:: 2024-05-01

Journal Name:: Electronics

Volume:: 13

Issue:: 9

ISSN:: 2079-9292

Page Range / eLocation ID:: 1727

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Journal Article:
https://doi.org/10.3390/electronics13091727

More Like this