NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A scalable reinforcement learning framework inspired by hippocampal memory mechanisms for efficient contextual and sequential decision making

https://doi.org/10.1038/s41598-025-10586-x

Poursiami, Hamed; Moshruba, Ayana; Cooper, Keiland_W; Gobin, Derek; Kaiser, Md_Abdullah-Al; Singh, Ankur; Noor, Rouhan; Shahbaba, Babak; Jaiswal, Akhilesh; Fortin, Norbert_J; et al (July 2025, Scientific Reports)
Retina-inspired Object Motion Segmentation for Event-Cameras

https://doi.org/10.1109/NICE65350.2025.11065149

Clerico, Victoria; Snyder, Shay; Lohia, Arya; Kaiser, Md Abdullah-Al; Schwartz, Gregory; Jaiswal, Akhilesh; Parsa, Maryam (March 2025, IEEE)

Event-cameras have emerged as a revolutionary technology with a high temporal resolution that far surpasses standard active pixel cameras. This technology draws biological inspiration from photoreceptors and the initial retinal synapse. This research showcases the potential of additional retinal functionalities to extract visual features. We provide a domain-agnostic and efficient algorithm for ego-motion compensation based on Object Motion Sensitivity (OMS), one of the multiple features computed within the mammalian retina. We develop a method based on experimental neuroscience that translates OMS’ biological circuitry to a low-overhead algorithm to suppress camera motion bypassing the need for deep networks and learning. Our system processes event data from dynamic scenes to perform pixel-wise object motion segmentation using a real and synthetic dataset. This paper introduces a bio-inspired computer vision method that dramatically reduces the number of parameters by 10^3 to 10^6 orders of magnitude compared to previous approaches. Our work paves the way for robust, high-speed, and low-bandwidth decision-making for in-sensor computations.
more » « less
Free, publicly-accessible full text available March 25, 2026
Toward High-Accuracy, Programmable Extreme-Edge Intelligence for Neuromorphic Vision Sensors utilizing Magnetic Domain Wall Motion-based MTJ

https://doi.org/10.1145/3649329.3657359

Kaiser, Md Abdullah-Al; Datta, Gourav; Beerel, Peter A; Jaiswal, Akhilesh R (June 2024, ACM)

Full Text Available
Performance Modeling Sparse MTTKRP Using Optical Static Random Access Memory on FPGA

https://doi.org/10.1109/HPEC55821.2022.9926407

Wijeratne, Sasindu; Jaiswal, Akhilesh; Jacob, Ajey P.; Zhang, Bingyi; Prasanna, Viktor (September 2022, IEEE High Performance Extreme Computing Conference)

Full Text Available
Modeling the Energy Efficiency of GEMM using Optical Random Access Memory

https://doi.org/10.1109/HPEC55821.2022.9926291

Zhang, Bingyi; Jaiswal, Akhilesh; Mathew, Clynn; Lakkireddy, Ravi Teja; Jacob, Ajey P.; Wijeratne, Sasindu; Prasanna, Viktor (September 2022, High Performance Extreme Computing Conference)

Full Text Available
Heterogeneously Integrated Quantum Chip Interposer Packaging

https://doi.org/10.1109/ECTC51906.2022.00294

Kudalippalliyalil, Ramesh; Chandran, Sujith; Jaiswal, Akhilesh; Wang, Kang L.; Jacob, Ajey P. (June 2022, 2022 IEEE 72nd Electronic Components and Technology Conference (ECTC))

Quantum computers provide faster solutions to specific compute-intensive classical problems. However, building a fault-tolerant quantum computer architecture is challenging and demands integrating several qubits with optimized signal routing while maintaining its quantum coherence. Experimental realization of such quantum computers with diverse functional components in a planar monolithic device architecture is challenging due to material and thermodynamic mismatch between various elements. Furthermore, it requires complex control and routing, resulting in parasitic modes and reduced qubit coherence. Thus, a scalable interposer architecture is essential to merge and interconnect different functionalities within a sophisticated chip while maintaining qubit coherence. As such, heterogeneous integration is an optimum solution to scale the qubit technology. We propose a heterogeneously integrated quantum chip optoelectronics interposer as a solution to the high-density scalable qubit architecture. Our technology is high-volume manufacturable and provides novel optical I/O solutions for on-chip, chip-to-chip, and cryogenic-to-outside world interconnect.
more » « less
Full Text Available
ACE-SNN: Algorithm-Hardware Co-design of Energy-Efficient & Low-Latency Deep Spiking Neural Networks for 3D Image Recognition

https://doi.org/10.3389/fnins.2022.815258

Datta, Gourav; Kundu, Souvik; Jaiswal, Akhilesh R.; Beerel, Peter A. (April 2022, Frontiers in Neuroscience)

High-quality 3D image recognition is an important component of many vision and robotics systems. However, the accurate processing of these images requires the use of compute-expensive 3D Convolutional Neural Networks (CNNs). To address this challenge, we propose the use of Spiking Neural Networks (SNNs) that are generated from iso-architecture CNNs and trained with quantization-aware gradient descent to optimize their weights, membrane leak, and firing thresholds. During both training and inference, the analog pixel values of a 3D image are directly applied to the input layer of the SNN without the need to convert to a spike-train. This significantly reduces the training and inference latency and results in high degree of activation sparsity, which yields significant improvements in computational efficiency. However, this introduces energy-hungry digital multiplications in the first layer of our models, which we propose to mitigate using a processing-in-memory (PIM) architecture. To evaluate our proposal, we propose a 3D and a 3D/2D hybrid SNN-compatible convolutional architecture and choose hyperspectral imaging (HSI) as an application for 3D image recognition. We achieve overall test accuracy of 98.68, 99.50, and 97.95% with 5 time steps (inference latency) and 6-bit weight quantization on the Indian Pines, Pavia University, and Salinas Scene datasets, respectively. In particular, our models implemented using standard digital hardware achieved accuracies similar to state-of-the-art (SOTA) with ~560.6× and ~44.8× less average energy than an iso-architecture full-precision and 6-bit quantized CNN, respectively. Adopting the PIM architecture in the first layer, further improves the average energy, delay, and energy-delay-product (EDP) by 30, 7, and 38%, respectively.
more » « less
Full Text Available
Hardware-Algorithm Re-Engineering of Retinal Circuit for Intelligent Object Motion Segmentation

https://doi.org/10.1109/ICONS62911.2024.00045

Sinaga, Jason; Clerico, Victoria; Abdullah-AI_Kaiser, Md; Snyder, Shay; Lohia, Arya; Schwartz, Gregory; Parsa, Maryam; Jaiswal, Akhilesh (July 2024, IEEE)

Recent advances in retinal neuroscience have fueled various hardware and algorithmic efforts to develop retina- inspired solutions for computer vision tasks. In this work, we focus on a fundamental visual feature within the mammalian retina, Object Motion Sensitivity (OMS). Using DVS data from EV-IMO dataset, we analyze the performance of an algorithmic implementation of OMS circuitry for motion segmentation in presence of ego-motion. This holistic analysis considers the underlying constraints arising from the hardware circuit implementation. We present novel CMOS circuits that implement OMS functionality inside image sensors, while providing run-time re-configurability for key algorithmic parameters. In-sensor technologies for dynamical environment adaptation are crucial for ensuring high system performance. Finally, we verify the functionality and re-configurability of the proposed CMOS circuit designs through Cadence simulations in 180nm technology. In summary, the presented work lays foundation for hardware- algorithm re-engineering of known biological circuits to suit application needs.
more » « less
Full Text Available

Search for: All records