How to Obtain and Run Light and Efficient Deep Learning Networks

Chen, Fan; Wen, Wei; Song, Linghao; Zhang, Jingchi; Li, Hai Helen; Chen, Yiran

doi:10.1109/ICCAD45719.2019.8942106

Citation Details

How to Obtain and Run Light and Efficient Deep Learning Networks

As the model size of deep neural networks (DNNs) grows for better performance, the increase in computational cost associated with training and testing makes it extremely difficulty to deploy DNNs on end/edge devices with limited resources while also satisfying the response time requirement. To address this challenge, model compression which compresses model size and thus reduces computation cost is widely adopted in deep learning society. However, the practical impacts of hardware design are often ignored in these algorithm-level solutions, such as the increase of the random accesses to memory hierarchy and the constraints of memory capacity. On the other side, limited understanding about the computational needs at algorithm level may lead to unrealistic assumptions during the hardware designs. In this work, we will discuss this mismatch and provide how our approach addresses it through an interactive design practice across both software and hardware levels. more »

Award ID(s):: 1822085 1910299 1717657

PAR ID:: 10179377

Author(s) / Creator(s):: Chen, Fan; Wen, Wei; Song, Linghao; Zhang, Jingchi; Li, Hai Helen; Chen, Yiran

Date Published:: 2019-11-01

Journal Name:: The 2019 International Conference on Computer-Aided Design (ICCAD)

Page Range / eLocation ID:: 1 to 5

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICCAD45719.2019.8942106

More Like this