NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Exploring Algorithmic Design Choices for Low Latency CNN Deployment

https://doi.org/10.1109/HiPC62374.2024.00017

Li, Changxin; Kuppannagari, Sanmukh (December 2024, IEEE)

Free, publicly-accessible full text available December 18, 2025
A Framework to Enable Algorithmic Design Choice Exploration in DNNs

https://doi.org/10.1109/HPEC62836.2024.10938475

Cronin, Timothy L; Kuppannagari, Sanmukh (September 2024, IEEE)

Full Text Available
PARAG: PIM Architecture for Real-Time Acceleration of GCNs

https://doi.org/10.1109/HiPC58850.2023.00016

Singh, Gian; Kuppannagari, Sanmukh R; Vrudhula, Sarma (December 2023, IEEE)

Graph Convolutional Networks (GCNs) have successfully incorporated deep learning to graph structures for social network analysis, bio-informatics, etc. The execution pattern of GCNs is a hybrid of graph processing and neural networks which poses unique and significant challenges for hardware implementation. Graph processing involves a large amount of irregular memory access with little computation whereas processing of neural networks involves a large number of operations with regular memory access. Existing graph processing and neural network accelerators are therefore inefficient for computing GCNs. This paper presents Parag, processing in memory (PIM) architecture for GCN computation. It consists of customized logic with minuscule computing units called Neural Processing Elements (NPEs) interfaced to each bank of the DRAM to support parallel graph processing and neural network computation. It utilizes the massive internal parallelism of DRAM to accelerate the GCN execution with high energy efficiency. Simulation results for inference of GCN over standard datasets show a latency and energy reduction by three orders of magnitude over a CPU implementation. When compared to a state-of-the-art PIM architecture, PARAG achieves on an average 4x reduction in latency and 4.23x reduction in the energy-delay-product (EDP).
more » « less
Full Text Available
Bandwidth Efficient Homomorphic Encrypted Matrix Vector Multiplication Accelerator on FPGA

https://doi.org/10.1109/ICFPT56656.2022.9974369

Yang, Yang; Kuppannagari, Sanmukh R.; Kannan, Rajgopal; Prasanna, Viktor K. (December 2022, 2022 International Conference on Field-Programmable Technology (ICFPT))

Full Text Available
PPOAccel: A High-Throughput Acceleration Framework for Proximal Policy Optimization

https://doi.org/10.1109/TPDS.2021.3134709

Meng, Yuan; Kuppannagari, Sanmukh; Kannan, Rajgopal; Prasanna, Viktor (September 2022, IEEE Transactions on Parallel and Distributed Systems)

Full Text Available
NTTGen: a framework for generating low latency NTT implementations on FPGA

https://doi.org/10.1145/3528416.3530225

Yang, Yang; Kuppannagari, Sanmukh R.; Kannan, Rajgopal; Prasanna, Viktor K. (May 2022, 19th ACM International Conference on Computing Frontiers (CF ’22), 2022)

Full Text Available
FPGA Accelerator for Homomorphic Encrypted Sparse Convolutional Neural Network Inference

https://doi.org/10.1109/FCCM53951.2022.9786115

Yang, Yang; Kuppannagari, Sanmukh R.; Kannan, Rajgopal; Prasanna, Viktor K. (May 2022, IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM), 2022)

Full Text Available
Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations

https://doi.org/10.1109/HiPC53243.2021.00014

Zhang, Chi; Kuppannagari, Sanmukh Rao; Prasanna, Viktor K (December 2021, 2021 IEEE 28th International Conference on High Performance Computing, Data, and Analytics (HiPC))

Full Text Available
How to Avoid Zero-Spacing in Fractionally-Strided Convolution? A Hardware-Algorithm Co-Design Methodology

https://doi.org/10.1109/HiPC53243.2021.00022

Meng, Yuan; Kuppannagari, Sanmukh; Kannan, Rajgopal; Prasanna, Viktor (December 2021, 2021 IEEE 28th International Conference on High Performance Computing, Data, and Analytics (HiPC))

Full Text Available
Efficient Neighbor-Sampling-based GNN Training on CPU-FPGA Heterogeneous Platform

https://doi.org/10.1109/HPEC49654.2021.9622822

Zhang, Bingyi; Kuppannagari, Sanmukh R.; Kannan, Rajgopal; Prasanna, Viktor (September 2021, IEEE High Performance Extreme Computing Conference, 2021)

Full Text Available

« Prev Next »

Search for: All records