Mobile or FPGA? A Comprehensive Evaluation on Energy Efficiency and a Unified Optimization Framework

Yuan, Geng; Dong, Peiyan; Sun, Mengshu; Niu, Wei; Li, Zhengang; Cai, Yuxuan; Li, Yanyu; Liu, Jun; Jiang, Weiwen; Lin, Xue; Ren, Bin; Tang, Xulong; Wang, Yanzhi

doi:10.1145/3528578

Citation Details

Mobile or FPGA? A Comprehensive Evaluation on Energy Efficiency and a Unified Optimization Framework

Efficient deployment of Deep Neural Networks (DNNs) on edge devices (i.e., FPGAs and mobile platforms) is very challenging, especially under a recent witness of the increasing DNN model size and complexity. Model compression strategies, including weight quantization and pruning, are widely recognized as effective approaches to significantly reduce computation and memory intensities, and have been implemented in many DNNs on edge devices. However, most state-of-the-art works focus on ad-hoc optimizations, and there lacks a thorough study to comprehensively reveal the potentials and constraints of different edge devices when considering different compression strategies. In this paper, we qualitatively and quantitatively compare the energy efficiency of FPGA-based and mobile-based DNN executions using mobile GPU and provide a detailed analysis. Based on the observations obtained from the analysis, we propose a unified optimization framework using block-based pruning to reduce the weight storage and accelerate the inference speed on mobile devices and FPGAs, achieving high hardware performance and energy-efficiency gain while maintaining accuracy. more »

Award ID(s):: 2047516

PAR ID:: 10357922

Author(s) / Creator(s):: Yuan, Geng; Dong, Peiyan; Sun, Mengshu; Niu, Wei; Li, Zhengang; Cai, Yuxuan; Li, Yanyu; Liu, Jun; Jiang, Weiwen; Lin, Xue; Ren, Bin; Tang, Xulong; Wang, Yanzhi

Date Published:: 2022-01-01

Journal Name:: ACM Transactions on Embedded Computing Systems

ISSN:: 1539-9087

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1145/3528578

More Like this