Single-Step Extraction of Transformer Attention with Dual-Gated Memtransistor Crossbars

Jayasinghe, Nethmi; Hashem, Maeesha Binte; Jayasuriya, Dinithi; Rahimifard, Leila; Kang, Min-A; Sangwan, Vinod K; Hersam, Mark C; Trivedi, Amit Ranjan

doi:10.1109/LED.2024.3435540

Citation Details

Single-Step Extraction of Transformer Attention with Dual-Gated Memtransistor Crossbars

We discuss how a dual-gated memtransistor crossbar can accelerate the extraction of the Transformer’s attention scores. A memtransistor is a novel two-dimensional material-based device that offers non-volatile programmability and gate tunability. Leveraging these attributes, we demonstrate the extraction of quadratic-order products on a single memtransistor and the single-step extraction of attention scores without inferring intermediate query/key vectors. The query/key-free processing of memtransistor-based attention scoring results in 2.37× lower energy with less than half crossbar cells. more »

Award ID(s):: 2106964 2317974

PAR ID:: 10538088

Author(s) / Creator(s):: Jayasinghe, Nethmi; Hashem, Maeesha Binte; Jayasuriya, Dinithi; Rahimifard, Leila; Kang, Min-A; Sangwan, Vinod K; Hersam, Mark C; Trivedi, Amit Ranjan

Publisher / Repository:: IEEE

Date Published:: 2024-10-01

Journal Name:: IEEE Electron Device Letters

Volume:: 45

ISSN:: 0741-3106

Page Range / eLocation ID:: 2005

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1109/LED.2024.3435540

More Like this