NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

CarbonSet: A Dataset to Analyze Trends and Benchmark the Sustainability of CPUs and GPUs

https://doi.org/10.1145/3716368.3735235

Hu, Jiajun; Choppali_Sudarshan, Chetan; Clifford, Maxwell; Chhabria, Vidya; Arora, Aman (June 2025, ACM)

Full Text Available
A Scalable and Energy-Efficient Processing-in-Memory Architecture for Gen-AI

https://doi.org/10.1109/JETCAS.2025.3566929

Singh, Gian; Vrudhula, Sarma (June 2025, IEEE Journal on Emerging and Selected Topics in Circuits and Systems)

Large language models (LLMs) have achieved high accuracy in diverse NLP and computer vision tasks due to self- attention mechanisms relying on GEMM and GEMV operations. However, scaling LLMs poses significant computational and energy challenges, particularly for traditional Von-Neumann architectures (CPUs/GPUs), which incur high latency and energy consumption from frequent data movement. These issues are even more pronounced in energy-constrained edge environments. While DRAM-based near-memory architectures offer improved energy efficiency and throughput, their processing elements are limited by strict area, power, and timing constraints. This work introduces CIDAN-3D, a novel Processing-in-Memory (PIM) architecture tailored for LLMs. It features an ultra-low-power Neuron Processing Element (NPE) with high compute density (#Operations/Area), enabling ecient in-situ execution of LLM operations by leveraging high parallelism within DRAM. CIDAN- 3D reduces data movement, improves locality, and achieves substantial gains in performance and energy efficiency—showing up to 1.3X higher throughput and 21.9X better energy efficiency for smaller models, and 3X throughput and 7X energy improvement for large decoder-only models compared to prior near-memory designs. As a result, CIDAN-3D offers a scalable, energy-efficient platform for LLM-driven Gen-AI applications.
more » « less
Full Text Available
Beyond the Surface: The Necessity for Detailed Metrics in Corporate Sustainability Reports

https://doi.org/10.1109/IGSC64514.2024.00035

Sudarshan, Chetan Choppali; Arora, Aman; Chhabria, Vidya A (November 2024, IEEE)

Full Text Available
A High Throughput, Energy-Efficient Architecture for Variable Precision Computing in DRAM

https://doi.org/10.1109/VLSI-SoC62099.2024.10767834

Singh, Gian; Dube, Ayushi; Vrudhula, Sarma (October 2024, IEEE)

Full Text Available
ML-INSIGHT: Machine Learning for Inrush Current Prediction and Power Switch Network Improvement

Gopalakrishnan, Vikram; Wu, Bing-Yue; Chhabria, Vidya A (August 2024, ACM)

Full Text Available
GreenFPGA: Evaluating FPGAs as Environmentally Sustainable Computing Solutions

Sudarshan, Choppalli Chetan; Arora, Aman; Chhabria, Vidya A (June 2024, ACM)

Full Text Available
ECO-CHIP: Estimation of Carbon Footprint of Chiplet-based Architectures for Sustainable VLSI

https://doi.org/10.1109/HPCA57654.2024.00058

Sudarshan, Chetan Choppali; Matkar, Nikhil; Vrudhula, Sarma; Sapatnekar, Sachin S; Chhabria, Vidya A (March 2024, IEEE)

Full Text Available

Search for: All records