NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

INTIACC: A Programmable Floating-Point Accelerator for Partial Differential Equations

https://doi.org/10.1109/JSSC.2024.3379308

Huang, Paul Xuanyuanliang; Tsividis, Yannis; Seok, Mingoo (January 2024, IEEE Journal of Solid-State Circuits)
Mathew, Sanu (Ed.)
This article presents a 32-bit floating-point (FP32) programmable accelerator for solving a wide range of partial differential equations (PDEs) based on numerical integration methods. Compared to prior works that have fixed-point systems and are only applicable to specific types of PDEs, our proposed, integration accelerator for PDEs, named INTIACC, accelerator consists of 16 locally interconnected processing elements (PEs) where each PE is a fully programmable reduced instruction set computer (RISC) processor with an FP32 arithmetic logic unit (FP32 ALU) and a custom-designed instruction set architecture (ISA). These features enable INTIACC to generate solutions with high precision and a wide dynamic range and also allow users to implement different numerical algorithms to perform high-order integration methods and to evaluate nonlinear functions. In addition, we create a novel slow-global-fast-local clocking scheme in which PEs operate asynchronously with each other most of the time. We prototype the INTIACC test chip in 65 nm, with a core area of 0.975 mm2. Running at an average local clock frequency of 570 MHz at 1 V, it offers a single-precision computation throughput of 9.12 GFLOPS. Testing results show that with a similar energy-delay product, INTIACC is up to 40× faster than the prior state-of-the-art PDE solver.
more » « less
Full Text Available
iMCU: A 28-nm Digital In-Memory Computing-Based Microcontroller Unit for TinyML

https://doi.org/10.1109/JSSC.2024.3362274

Lin, Chuan-Tung; Huang, Paul Xuanyuanliang; Oh, Jonghyun; Wang, Dewei; Seok, Mingoo (January 2024, IEEE Journal of Solid-State Circuits)

Full Text Available
iMCU: A 102-μJ, 61-ms Digital In-Memory Computingbased Microcontroller Unit for Edge TinyML

https://doi.org/10.1109/CICC57935.2023.10121221

Lin, Chuan-Tung; Huang, Paul Xuanyuanliang; Oh, Jonghyun; Wang, Dewei; Seok, Mingoo (April 2023, 2023 IEEE Custom Integrated Circuits Conference (CICC))

Full Text Available
INTIACC: A 32-bit Floating-Point Programmable Custom-ISA Accelerator for Solving Classes of Partial Differential Equations

https://doi.org/10.1109/ESSCIRC55480.2022.9911441

Huang, Paul Xuanyuanliang; Jang, Daniel; Tsividis, Yannis; Seok, Mingoo (September 2022, ESSCIRC 2022- IEEE 48th European Solid State Circuits Conference (ESSCIRC))

We propose a numerical integration accelerator (INTIACC) that speeds up the solution of partial differential equations (PDEs) for scientific computing. In contrast to recent works, INTIACC applies to a variety of PDEs and boundary conditions, has enhanced nonlinear function capability, supports high-order integration algorithms, and uses floating-point arithmetic for orders of magnitude smaller solution error. With all the benefits, our test chip still achieves 40X speed-up over prior accelerators and orders of magnitudes over CPU and GPU based systems.
more » « less
Full Text Available
A 4.2-to-0.5-V, 0.8-μA–0.8-mA, Power-Efficient Three-Level SIMO Buck Converter for a Quad-Voltage RISC-V Microprocessor

https://doi.org/10.1109/TVLSI.2024.3477632

Kim, Dongkwun; Wang, Zhaoqing; Huang, Paul Xuanyuanliang; Chundi, Pavan Kumar; Kim, Suhwan; Blanco, Andrés A; Krishnamurthy, Ram K; Seok, Mingoo (January 2025, IEEE Transactions on Very Large Scale Integration (VLSI) Systems)

Full Text Available

Search for: All records