Spiking Transformer Hardware Accelerators in 3D Integration

Xu, Boxun; Hwang, Junyoung; Vanna-iampikul, Pruek; Lim, Sung_Kyu; Li, Peng

Citation Details

Spiking neural networks (SNNs) are powerful models of spatiotemporal computation and are well suited for deployment on resource-constrained edge devices and neuromorphic hardware due to their low power consumption. Leveraging attention mechanisms similar to those found in their artificial neural network counterparts, recently emerged spiking transformers have showcased promising performance and efficiency by capitalizing on the binary nature of spiking operations. Recognizing the current lack of dedicated hardware support for spiking transformers, this paper presents the first work on 3D spiking transformer hardware architecture and design methodology. We present an architecture and physical design co-optimization approach tailored specifically for spiking transformers. Through memory-on-logic and logic-on-logic stacking enabled by 3D integration, we demonstrate significant energy and delay improvements compared to conventional 2D CMOS integration. more »

Award ID(s):: 2310170 1948201

PAR ID:: 10570164

Author(s) / Creator(s):: Xu, Boxun; Hwang, Junyoung; Vanna-iampikul, Pruek; Lim, Sung_Kyu; Li, Peng

Publisher / Repository:: IEEE/ACM International Conference on Computer-Aided Design (ICCAD ’24)

Date Published:: 2024-10-27

ISBN:: 979-8-4007-1077-3

Format(s):: Medium: X

Location:: New York, NY

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this