NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

FP-IMC: A 28nm All-Digital Configurable Floating-Point In-Memory Computing Macro

https://doi.org/10.1109/ESSCIRC59616.2023.10268770

Saikia, Jyotishman; Sridharan, Amitesh; Yeo, Injune; Venkataramanaiah, Shreyas; Fan, Deliang; Seo, Jae-Sun (September 2023, IEEE European Solid State Circuits Conference (ESSCIRC))
A 65nm RRAM Compute-in-Memory Macro for Genome Sequencing Alignment

https://doi.org/10.1109/ESSCIRC59616.2023.10268783

Zhang, Fan; He, Wangxin; Yeo, Injune; Liehr, Maximilian; Cady, Nathaniel; Cao, Yu; Seo, Jae-Sun; Fan, Deliang (September 2023, IEEE European Solid State Circuits Conference (ESSCIRC))
A 28-nm 8-bit Floating-Point Tensor Core-Based Programmable CNN Training Processor With Dynamic Structured Sparsity

https://doi.org/10.1109/JSSC.2023.3269148

Venkataramanaiah, Shreyas Kolala; Meng, Jian; Suh, Han-Sok; Yeo, Injune; Saikia, Jyotishman; Cherupally, Sai Kiran; Zhang, Yichi; Zhang, Zhiru; Seo, Jae-Sun (July 2023, IEEE Journal of Solid-State Circuits)

Full Text Available
Algorithm-hardware Co-optimization for Energy-efficient Drone Detection on Resource-constrained FPGA

https://doi.org/10.1145/3583074

Suh, Han-Sok; Meng, Jian; Nguyen, Ty; Kumar, Vijay; Cao, Yu; Seo, Jae-Sun (June 2023, ACM Transactions on Reconfigurable Technology and Systems)

Convolutional neural network (CNN)-based object detection has achieved very high accuracy; e.g., single-shot multi-box detectors (SSDs) can efficiently detect and localize various objects in an input image. However, they require a high amount of computation and memory storage, which makes it difficult to perform efficient inference on resource-constrained hardware devices such as drones or unmanned aerial vehicles (UAVs). Drone/UAV detection is an important task for applications including surveillance, defense, and multi-drone self-localization and formation control. In this article, we designed and co-optimized an algorithm and hardware for energy-efficient drone detection on resource-constrained FPGA devices. We trained an SSD object detection algorithm with a custom drone dataset. For inference, we employed low-precision quantization and adapted the width of the SSD CNN model. To improve throughput, we use dual-data rate operations for DSPs to effectively double the throughput with limited DSP counts. For different SSD algorithm models, we analyze accuracy or mean average precision (mAP) and evaluate the corresponding FPGA hardware utilization, DRAM communication, and throughput optimization. We evaluated the FPGA hardware for a custom drone dataset, Pascal VOC, and COCO2017. Our proposed design achieves a high mAP of 88.42% on the multi-drone dataset, with a high energy efficiency of 79 GOPS/W and throughput of 158 GOPS using the Xilinx Zynq ZU3EG FPGA device on the Open Vision Computer version 3 (OVC3) platform. Our design achieves 1.1 to 8.7× higher energy efficiency than prior works that used the same Pascal VOC dataset, using the same FPGA device, but at a low-power consumption of 2.54 W. For the COCO dataset, our MobileNet-V1 implementation achieved an mAP of 16.8, and 4.9 FPS/W for energy-efficiency, which is ∼ 1.9× higher than prior FPGA works or other commercial hardware platforms.
more » « less
Full Text Available
PRIVE: Efficient RRAM Programming with Chip Verification for RRAM-based In-Memory Computing Acceleration

https://doi.org/10.23919/DATE56975.2023.10137266

He, Wangxin; Meng, Jian; Gonugondla, Sujan Kumar; Yu, Shimeng; Shanbhag, Naresh R; Seo, Jae-sun (April 2023, Design, Automation & Test in Europe Conference & Exhibition (DATE))

Full Text Available
A 92 F^2/ bit Physically Unclonable Function Exploiting Channel Charge Injection and Mismatch Accumulation

https://doi.org/10.1109/CICC57935.2023.10121230

Yeo, Injune; Jee, Dong-Woo; Seo, Jae-Sun (April 2023, IEEE Custom Integrated Circuits Conference (CICC))

Full Text Available

Search for: All records