Compute-in-Memory Circuits and Architectures for Efficient Acceleration of AI and Data Centric Workloads

Sridharan, Amitesh

Citation Details

This content will become publicly available on August 15, 2026

Compute-in-Memory Circuits and Architectures for Efficient Acceleration of AI and Data Centric Workloads

This dissertation introduces a series of digital CIM circuits and architectures that significantly improve power, performance, and area (PPA) metrics for data-intensive workloads. It begins with a programmable CIM design that balances the flexibility of Central-Processing-Units(CPUs)/Graphics Processing Units(GPUs) with the efficiency of ASICs, enabling a broad class of applications. A prototype 28nm CMOS chip is then presented to accelerate general matrix-matrix multiplications (GEMMs) across various fixed-point precisions. The focus then shifts to sparse GEMM acceleration. The first design demonistrates how CIM tailored for channel decoders leverages both fixed and unstructured sparsity to outperform conventional designs. The second design, fabricated in 28nm CMOS, supports diverse unstructured sparse formats and integer precisions, efficiently targeting highly sparse deep neural networks (DNNs). The final design achieves state-of-the-art efficiency in compressed sparse GEMMs, supporting both integer and floating-point data types using shared hardware. It also integrates a RISC-V CPU to manage computation across diverse matrix sizes and model types. Together, these contributions advance CIM as a scalable and efficient platform for future AI and data-centric systems. more »

Award ID(s):: 2314591 2505326 2528767 2528723

PAR ID:: 10653075

Author(s) / Creator(s):: Sridharan, Amitesh

Publisher / Repository:: Arizona State University

Date Published:: 2025-08-15

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on August 15, 2026
Dissertation:
The DOI is not currently available.

More Like this