NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

On the Origin of Holes During Polarization Reset in Floating Body Ferroelectric FETs Towards Improving Switching Efficiency

https://doi.org/10.1109/IEDM50854.2024.10873452

Jiang, Zhouhang; Xiao, Yi; Weling, Milind; Mulaosmanovic, Halid; Duenkel, Stefan; Kleimaier, Dominik; Soss, Steven; Beyer, Sven; Joshi, Rajiv; Mohamed, Mohamed; et al (December 2024, IEEE)

Full Text Available
Paving the Way for Pass Disturb-Free Vertical NAND Storage via a Dedicated and String-Compatible Pass Gate

https://doi.org/10.1021/acsami.4c08190

Zhao, Zijian; Woo, Sola; Aabrar, Khandker Akif; Kirtania, Sharadindu Gopal; Jiang, Zhouhang; Deng, Shan; Xiao, Yi; Mulaosmanovic, Halid; Duenkel, Stefan; Kleimaier, Dominik; et al (October 2024, ACS Applied Materials & Interfaces)

Full Text Available
Ferroelectric FET-based context-switching FPGA enabling dynamic reconfiguration for adaptive deep learning machines

https://doi.org/10.1126/sciadv.adk1525

Xu, Yixin; Zhao, Zijian; Xiao, Yi; Yu, Tongguang; Mulaosmanovic, Halid; Kleimaier, Dominik; Duenkel, Stefan; Beyer, Sven; Gong, Xiao; Joshi, Rajiv; et al (January 2024, Science Advances)

Field programmable gate array (FPGA) is widely used in the acceleration of deep learning applications because of its reconfigurability, flexibility, and fast time-to-market. However, conventional FPGA suffers from the trade-off between chip area and reconfiguration latency, making efficient FPGA accelerations that require switching between multiple configurations still elusive. Here, we propose a ferroelectric field-effect transistor (FeFET)–based context-switching FPGA supporting dynamic reconfiguration to break this trade-off, enabling loading of arbitrary configuration without interrupting the active configuration execution. Leveraging the intrinsic structure and nonvolatility of FeFETs, compact FPGA primitives are proposed and experimentally verified. The evaluation results show our design shows a 63.0%/74.7% reduction in a look-up table (LUT)/connection block (CB) area and 82.7%/53.6% reduction in CB/switch box power consumption with a minimal penalty in the critical path delay (9.6%). Besides, our design yields significant time savings by 78.7 and 20.3% on average for context-switching and dynamic reconfiguration applications, respectively.
more » « less
Full Text Available
Powering Disturb-Free Reconfigurable Computing and Tunable Analog Electronics with Dual-Port Ferroelectric FET

https://doi.org/10.1021/acsami.3c07827

Zhao, Zijian; Deng, Shan; Chatterjee, Swetaki; Jiang, Zhouhang; Islam, Muhammad Shaffatul; Xiao, Yi; Xu, Yixin; Meninger, Scott; Mohamed, Mohamed; Joshi, Rajiv; et al (November 2023, ACS Applied Materials & Interfaces)

Full Text Available
On the Write Schemes and Efficiency of FeFET 1T NOR Array for Embedded Nonvolatile Memory and Beyond

https://doi.org/10.1109/IEDM45625.2022.10019542

Xiao, Yi; Xu, Yixin; Jiang, Zhouhang; Deng, Shan; Zhao, Zijian; Mallick, Antik; Sun, Limeng; Joshi, Rajiv; Li, Xueqing; Shukla, Nikhil; et al (December 2022, IEDM)

Full Text Available
Hybrid RRAM/SRAM In-Memory Computing for Robust DNN Acceleration

https://doi.org/10.1109/TCAD.2022.3197516

Krishnan, Gokul; Wang, Zhenyu; Yeo, Injune; Yang, Li; Meng, Jian; Liehr, Maximilian; Joshi, Rajiv V.; Cady, Nathaniel C.; Fan, Deliang; Seo, Jae-sun; et al (August 2022, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)

RRAM-based in-memory computing (IMC) effectively accelerates deep neural networks (DNNs) and other machine learning algorithms. On the other hand, in the presence of RRAM device variations and lower precision, the mapping of DNNs to RRAM-based IMC suffers from severe accuracy loss. In this work, we propose a novel hybrid IMC architecture that integrates an RRAM-based IMC macro with a digital SRAM macro using a programmable shifter to compensate for the RRAM variations and recover the accuracy. The digital SRAM macro consists of a small SRAM memory array and an array of multiply-and-accumulate (MAC) units. The non-ideal output from the RRAM macro, due to device and circuit non-idealities, is compensated by adding the precise output from the SRAM macro. In addition, the programmable shifter allows for different scales of compensation by shifting the SRAM macro output relative to the RRAM macro output. On the algorithm side, we develop a framework for the training of DNNs to support the hybrid IMC architecture through ensemble learning. The proposed framework performs quantization (weights and activations), pruning, RRAM IMC-aware training, and employs ensemble learning through different compensation scales by utilizing the programmable shifter. Finally, we design a silicon prototype of the proposed hybrid IMC architecture in the 65nm SUNY process to demonstrate its efficacy. Experimental evaluation of the hybrid IMC architecture shows that the SRAM compensation allows for a realistic IMC architecture with multi-level RRAM cells (MLC) even though they suffer from high variations. The hybrid IMC architecture achieves up to 21.9%, 12.65%, and 6.52% improvement in post-mapping accuracy over state-of-the-art techniques, at minimal overhead, for ResNet-20 on CIFAR-10, VGG-16 on CIFAR-10, and ResNet-18 on ImageNet, respectively.
more » « less
Full Text Available
Accurate Inference With Inaccurate RRAM Devices: A Joint Algorithm-Design Solution

https://doi.org/10.1109/JXCDC.2020.2987605

Charan, Gouranga; Mohanty, Abinash; Du, Xiaocong; Krishnan, Gokul; Joshi, Rajiv V.; Cao, Yu (June 2020, IEEE Journal on Exploratory Solid-State Computational Devices and Circuits)

Search for: All records