CANNON: C ommunication- A ware Sparse N eural N etwork O ptimizatio n

Goksoy, A. Alper; Li, Guihong; Mandal, Sumit K.; Ogras, Umit Y.; Marculescu, Radu

doi:10.1109/TETC.2023.3289778

Citation Details

CANNON: C ommunication- A ware Sparse N eural N etwork O ptimizatio n

Sparse deep neural networks (DNNs) have the potential to deliver compelling performance and energy efficiency without significant accuracy loss. However, their benefits can quickly diminish if their training is oblivious to the target hardware. For example, fewer critical connections can have a significant overhead if they translate into long-distance communication on the target hardware. Therefore, hardware-aware sparse training is needed to leverage the full potential of sparse DNNs. To this end, we propose a novel and comprehensive communication-aware sparse DNN optimization framework for tile-based in-memory computing (IMC) architectures. The proposed technique, CANNON first maps the DNN layers onto the tiles of the target architecture. Then, it replaces the fully connected and convolutional layers with communication-aware sparse connections. After that, CANNON optimizes the communication cost with minimal impact on the DNN accuracy. Extensive experimental evaluations with a wide range of DNNs and datasets show up to 3.0× lower communication energy, 3.1× lower communication latency, and 6.8× lower energy-delay product compared to state-of-the-art pruning approaches with a negligible impact on the classification accuracy on IMC-based machine learning accelerators. more »

Award ID(s):: 2007284

PAR ID:: 10468127

Author(s) / Creator(s):: Goksoy, A. Alper; Li, Guihong; Mandal, Sumit K.; Ogras, Umit Y.; Marculescu, Radu

Publisher / Repository:: IEEE

Date Published:: 2023-01-01

Journal Name:: IEEE Transactions on Emerging Topics in Computing

ISSN:: 2376-4562

Page Range / eLocation ID:: 1 to 13

Subject(s) / Keyword(s):: Hardware-aware pruning, communication-aware pruning, mapping, sparse neural networks

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1109/TETC.2023.3289778

More Like this