TEFLON: Thermally Efficient Dataflow-aware 3D NoC for Accelerating CNN Inferencing on Manycore PIM Architectures

Narang, Gaurav  (ORCID:0000000195171280); Ogbogu, Chukwufumnanya  (ORCID:0000000281701161); Doppa, Janardhan_Rao  (ORCID:0000000238485301); Pande, Partha_Pratim  (ORCID:0000000259308531)

doi:10.1145/3665279

Citation Details

TEFLON: Thermally Efficient Dataflow-aware 3D NoC for Accelerating CNN Inferencing on Manycore PIM Architectures

Resistive random-access memory (ReRAM)-based processing-in-memory (PIM) architectures are used extensively to accelerate inferencing/training with convolutional neural networks (CNNs). Three-dimensional (3D) integration is an enabling technology to integrate many PIM cores on a single chip. In this work, we propose the design of athermallyefficient dataflow-aware monolithic 3D (M3D)NoC architecture referred to asTEFLONto accelerate CNN inferencing without creating any thermal bottlenecks.TEFLONreduces the Energy-Delay-Product (EDP) by 42%, 46%, and 45% on an average compared to a conventional 3D mesh NoC for systems with 36-, 64-, and 100-PIM cores, respectively.TEFLONreduces the peak chip temperature by 25Kand improves the inference accuracy by up to 11% compared to sole performance-optimized SFC-based counterpart for inferencing with diverse deep CNN models using CIFAR-10/100 datasets on a 3D system with 100-PIM cores. more »

Award ID(s):: 1955353 2308530

PAR ID:: 10606862

Author(s) / Creator(s):: Narang, Gaurav ; Ogbogu, Chukwufumnanya ; Doppa, Janardhan_Rao ; Pande, Partha_Pratim

Publisher / Repository:: Association for Computing Machinery (ACM)

Date Published:: 2024-08-14

Journal Name:: ACM Transactions on Embedded Computing Systems

Volume:: 23

Issue:: 5

ISSN:: 1539-9087

Format(s):: Medium: X Size: p. 1-23

Size(s):: p. 1-23

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1145/3665279

More Like this