NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Hybrid Timestamping Using Crystal and RC Oscillators for Shock-Resistant Precision

https://doi.org/10.1109/TVLSI.2025.3544410

Hamed, Ehab A; Carichner, Gordy; Green, Delbert A; Kim, Hun-Seok; Lee, Inhee (July 2025, IEEE Transactions on Very Large Scale Integration (VLSI) Systems)

Free, publicly-accessible full text available July 1, 2026
BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference

Lee, Changwoo; Kwon, Soo Min; Qu, Qing; Kim, Hun-Seok (December 2024, Advances in Neural Information Processing Systems)

Large-scale foundation models have demonstrated exceptional performance in language and vision tasks. However, the numerous dense matrix-vector operations involved in these large networks pose significant computational challenges during inference. To address these challenges, we introduce the Block-Level Adaptive STructured (BLAST) matrix, designed to learn and leverage efficient structures prevalent in the weight matrices of linear layers within deep learning models. Compared to existing structured matrices, the BLAST matrix offers substantial flexibility, as it can represent various types of structures that are either learned from data or computed from pre-existing weight matrices. We demonstrate the efficiency of using the BLAST matrix for compressing both language and vision tasks, showing that (i) for medium-sized models such as ViT and GPT-2, training with BLAST weights boosts performance while reducing complexity by 70% and 40%, respectively; and (ii) for large foundation models such as Llama-7B and DiT-XL, the BLAST matrix achieves a 2x compression while exhibiting the lowest performance degradation among all tested structured matrices. Our code is available at https://github.com/changwoolee/BLAST.
more » « less
Free, publicly-accessible full text available December 6, 2025
BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference

Lee, Changwoo; Kwon, Soo Min; Qu, Qing; Kim, Hun-Seok (December 2024, Advances in Neural Information Processing Systems)

Free, publicly-accessible full text available December 5, 2025
A Crystal-Less Frequency-Modulation Transmitter IC with Joint Neural-Network-Driven Modulation and Coding for Low-Power Connectivity

https://doi.org/10.1109/ISSCC49661.2025.10904502

Shen, Yi; Chang, Boxuan; Tseng, Chien-Wei; Wang, Yunfan; Zhang, Qirui; Fan, Zichen; Feng, Zhen; Narashimha, Rahul; Bejarano-Carbo, Andrea; Kim, Hun-Seok; et al (February 2025, Digest of technical papers)

Free, publicly-accessible full text available February 16, 2026
Canalis: A Throughput-Optimized Framework for Real-Time Stream Processing of Wireless Communication

https://doi.org/10.1145/3695880

Chen, Kuan-Yu; Mason_Nelson, Thomas; Khadem, Alireza; Fayazi, Morteza; Singapuram, Sanjay_Sri Vallabh; Dreslinski, Ronald; Talati, Nishil; Kim, Hun-Seok; Blaauw, David (December 2024, ACM Transactions on Reconfigurable Technology and Systems)

Stream processing, which involves real-time computation of data as it is created or received, is vital for various applications, specifically wireless communication. The evolving protocols, the requirement for high-throughput, and the challenges of handling diverse processing patterns make it demanding. Traditional platforms grapple with meeting real-time throughput and latency requirements due to large data volume, sequential and indeterministic data arrival, and variable data rates, leading to inefficiencies in memory access and parallel processing. We present Canalis, a throughput-optimized framework designed to address these challenges, ensuring high-performance while achieving low energy consumption. Canalis is a hardware-software co-designed system. It includes a programmable spatial architecture, Flux Stream Processing Unit (FluxSPU), proposed by this work to enhance data throughput and energy efficiency. FluxSPU is accompanied by a software stack that eases the programming process. We evaluated Canalis with eight distinct benchmarks. When compared to CPU and GPU in mobile SoC to demonstrate the effectiveness of domain specialization, Canalis achieves an average speedup of 13.4\(\times\)and 6.6\(\times\), and energy savings of 189.8\(\times\)and 283.9\(\times\), respectively. In contrast to equivalent ASICs of the benchmarks, the average energy overhead of Canalis is within 2.4\(\times\), successfully maintaining generalizations without incurring significant overhead.
more » « less
Free, publicly-accessible full text available December 31, 2025
ParaBase: A Configurable Parallel Baseband Processor for Ultra-High-Speed Inter-Satellite Optical Communications

https://doi.org/10.1145/3665314.3673174

Choi, Seungkyu; Deng, Huanshihong; Chen, Kuan-Yu; Yue, Yufan; Blaauw, David; Kim, Hun Seok (August 2024, ACM)

Full Text Available
DAP: A 507-GMACs/J 256-Core Domain Adaptive Processor for Wireless Communication and Linear Algebra Kernels in 12-nm FINFET

https://doi.org/10.1109/JSSC.2024.3438758

Chen, Kuan-Yu; Yang, Chi-Sheng; Sun, Yu-Hsiu; Tseng, Chien-Wei; Fayazi, Morteza; He, Xin; Feng, Siying; Yue, Yufan; Mudge, Trevor; Dreslinski, Ronald; et al (August 2024, IEEE Journal of Solid-State Circuits)

Full Text Available
A Low-Power Highly Reconfigurable Analog FIR Filter With 11-Bit Charge-Domain DAC for Narrowband Receivers

https://doi.org/10.1109/LSSC.2024.3361380

Tseng, Chien-Wei; Feng, Zhen; Fan, Zichen; An, Hyochan; Wang, Yunfan; Kim, Hun-Seok; Blaauw, David (January 2024, IEEE Solid-State Circuits Letters)

Full Text Available
Learning-Based Near-Orthogonal Superposition Code for MIMO Short Message Transmission

https://doi.org/10.1109/TCOMM.2023.3274158

Bian, Chenghong; Hsu, Chin-Wei; Lee, Changwoo; Kim, Hun-Seok (September 2023, IEEE Transactions on Communications)

Full Text Available
Instantaneous Feedback-Based Opportunistic Symbol Length Adaptation for Reliable Communication

https://doi.org/10.1109/TCOMM.2023.3266356

Hsu, Chin-Wei; Anastasopoulos, Achilleas; Kim, Hun-Seok (July 2023, IEEE Transactions on Communications)

Full Text Available

« Prev Next »

Search for: All records