NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A compute-in-memory chip based on resistive random-access memory

https://doi.org/10.1038/s41586-022-04992-8

Wan, Weier; Kubendran, Rajkumar; Schaefer, Clemens; Eryilmaz, Sukru Burc; Zhang, Wenqiang; Wu, Dabin; Deiss, Stephen; Raina, Priyanka; Qian, He; Gao, Bin; et al (August 2022, Nature)

Abstract Realizing increasingly complex artificial intelligence (AI) functionalities directly on edge devices calls for unprecedented energy efficiency of edge hardware. Compute-in-memory (CIM) based on resistive random-access memory (RRAM) 1 promises to meet such demand by storing AI model weights in dense, analogue and non-volatile RRAM devices, and by performing AI computation directly within RRAM, thus eliminating power-hungry data movement between separate compute and memory 2–5 . Although recent studies have demonstrated in-memory matrix-vector multiplication on fully integrated RRAM-CIM hardware 6–17 , it remains a goal for a RRAM-CIM chip to simultaneously deliver high energy efficiency, versatility to support diverse models and software-comparable accuracy. Although efficiency, versatility and accuracy are all indispensable for broad adoption of the technology, the inter-related trade-offs among them cannot be addressed by isolated improvements on any single abstraction level of the design. Here, by co-optimizing across all hierarchies of the design from algorithms and architecture to circuits and devices, we present NeuRRAM—a RRAM-based CIM chip that simultaneously delivers versatility in reconfiguring CIM cores for diverse model architectures, energy efficiency that is two-times better than previous state-of-the-art RRAM-CIM chips across various computational bit-precisions, and inference accuracy comparable to software models quantized to four-bit weights across various AI tasks, including accuracy of 99.0 percent on MNIST 18 and 85.7 percent on CIFAR-10 19 image classification, 84.7-percent accuracy on Google speech command recognition 20 , and a 70-percent reduction in image-reconstruction error on a Bayesian image-recovery task.
more » « less
Full Text Available
Hierarchical Multicast Network-On-Chip for Scalable Reconfigurable Neuromorphic Systems

https://doi.org/10.1109/ISCAS48785.2022.9937961

Hota, Gopabandhu; Mysore, Nishant; Deiss, Stephen; Pedroni, Bruno; Cauwenberghs, Gert (May 2022, 2022 IEEE Int. Symp. Circuits and Systems (ISCAS’2022))

Full Text Available
Hierarchical Network Connectivity and Partitioning for Reconfigurable Large-Scale Neuromorphic Systems

https://doi.org/10.3389/fnins.2021.797654

Mysore, Nishant; Hota, Gopabandhu; Deiss, Stephen R.; Pedroni, Bruno U.; Cauwenberghs, Gert (January 2022, Frontiers in Neuroscience)

We present an efficient and scalable partitioning method for mapping large-scale neural network models with locally dense and globally sparse connectivity onto reconfigurable neuromorphic hardware. Scalability in computational efficiency, i.e., amount of time spent in actual computation, remains a huge challenge in very large networks. Most partitioning algorithms also struggle to address the scalability in network workloads in finding a globally optimal partition and efficiently mapping onto hardware. As communication is regarded as the most energy and time-consuming part of such distributed processing, the partitioning framework is optimized for compute-balanced, memory-efficient parallel processing targeting low-latency execution and dense synaptic storage, with minimal routing across various compute cores. We demonstrate highly scalable and efficient partitioning for connectivity-aware and hierarchical address-event routing resource-optimized mapping, significantly reducing the total communication volume recursively when compared to random balanced assignment. We showcase our results working on synthetic networks with varying degrees of sparsity factor and fan-out, small-world networks, feed-forward networks, and a hemibrain connectome reconstruction of the fruit-fly brain. The combination of our method and practical results suggest a promising path toward extending to very large-scale networks and scalable hardware-aware partitioning.
more » « less
Full Text Available
Markov Chain Abstractions of Electrochemical Reaction-Diffusion in Synaptic Transmission for Neuromorphic Computing

https://doi.org/10.3389/fnins.2021.698635

Wagner, Margot; Bartol, Thomas M.; Sejnowski, Terrence J.; Cauwenberghs, Gert (November 2021, Frontiers in Neuroscience)

Progress in computational neuroscience toward understanding brain function is challenged both by the complexity of molecular-scale electrochemical interactions at the level of individual neurons and synapses and the dimensionality of network dynamics across the brain covering a vast range of spatial and temporal scales. Our work abstracts an existing highly detailed, biophysically realistic 3D reaction-diffusion model of a chemical synapse to a compact internal state space representation that maps onto parallel neuromorphic hardware for efficient emulation at a very large scale and offers near-equivalence in input-output dynamics while preserving biologically interpretable tunable parameters.
more » « less
Full Text Available
Brain-Inspired Learning on Neuromorphic Substrates

https://doi.org/10.1109/JPROC.2020.3045625

Zenke, Friedemann; Neftci, Emre O. (May 2021, Proceedings of the IEEE)
Design Principles of Large-Scale Neuromorphic Systems Centered on High Bandwidth Memory

https://doi.org/10.1109/ICRC2020.2020.00013

Pedroni, Bruno U.; Deiss, Stephen R.; Mysore, Nishant; Cauwenberghs, Gert (December 2020, 2020 IEEE International Conference on Rebooting Computing (ICRC’2020))
null (Ed.)
Full Text Available
A 1.52 pJ/Spike Reconfigurable Multimodal Integrate-and-Fire Neuron Array Transceiver

https://doi.org/10.1145/3407197.3407209

Kubendran, Rajkumar; Wan, Weier; Joshi, Siddharth; Wong, H.-S. Philip; Cauwenberghs, Gert (July 2020, 2020 ACM Int. Conf. on Neuromorphic Systems (ICONS’2020))
null (Ed.)
Full Text Available
Neuromorphic Dynamical Synapses with Reconfigurable Voltage-Gated Kinetics

https://doi.org/10.1109/TBME.2019.2948809

Wang, Jun; Cauwenberghs, Gert; Broccard, Frederic D. (July 2020, IEEE Transactions on Biomedical Engineering)
null (Ed.)
Full Text Available
Synaptic Plasticity Dynamics for Deep Continuous Local Learning (DECOLLE)

https://doi.org/10.3389/fnins.2020.00424

Kaiser, Jacques; Mostafa, Hesham; Neftci, Emre (May 2020, Frontiers in Neuroscience)

Full Text Available
The Open EEGLAB Portal Interface:High-Performance Computing with EEGLAB

https://doi.org/10.1016/j.neuroimage.2020.116778

Martínez-Cancino, Ramón; Delorme, Arnaud; Truong, Dung; Artoni, Fiorenzo; Kreutz-Delgado, Kenneth; Sivagnanam, Subhashini; Yoshimoto, Kenneth; Majumdar, Amitava; Makeig, Scott (April 2020, NeuroImage)

Full Text Available

« Prev Next »

Search for: All records