NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Exploring the Potential for Generative AI-based Conversational Cues for Real-Time Collaborative Ideation

https://doi.org/10.1145/3635636.3656184

Rayan, Jude; Kanetkar, Dhruv; Gong, Yifan; Yang, Yuewen; Palani, Srishti; Xia, Haijun; Dow, Steven P (June 2024, ACM)

Full Text Available
Search for Efficient Large Language Models

https://doi.org/10.52202/079017-4421

Gong, Yifan; Kong, Zhenglun; Lin, Ming; Lin, Xue; Shen, Xuan; Wang, Yanzhi; Wu, Chao; Wu, Yushu; Zhan, Zheng; Zhao, Pu (January 2024, Neural Information Processing Systems Foundation, Inc. (NeurIPS))

Full Text Available
Fast and Memory-Efficient Video Diffusion Using Streamlined Inference

https://doi.org/10.52202/079017-0437

Gong, Yifan; Kong, Zhenglun; Meng, Zichong; Niu, Wei; Wang, Yanzhi; Wu, Yushu; Yang, Changdi; Yuan, Geng; Zhan, Zheng; Zhao, Pu (January 2024, Neural Information Processing Systems Foundation, Inc. (NeurIPS))

Full Text Available
MOC: Multi-Objective Mobile CPU-GPU Co-Optimization for Power-Efficient DNN Inference

https://doi.org/10.1109/ICCAD57390.2023.10323882

Wu, Yushu; Gong, Yifan; Zhan, Zheng; Yuan, Geng; Li, Yanyu; Wang, Qi; Wu, Chao; Wang, Yanzhi (October 2023, IEEE)

Full Text Available
Exploring Token Pruning in Vision State Space Models

https://doi.org/10.52202/079017-1613

Gong, Yifan; Ioannidis, Stratis; Kong, Zhenglun; Meng, Zichong; Niu, Wei; Shen, Xuan; Wang, Yanzhi; Wu, Yushu; Zhan, Zheng; Zhao, Pu; et al (January 2024, Neural Information Processing Systems Foundation, Inc. (NeurIPS))

Full Text Available
DualHSIC: HSIC-Bottleneck and Alignment for Continual Learning

Wang, Zifeng; Zhan, Zheng; Gong, Yifan; Shao, Yucai; Ioannidis, Stratis; Wang, Yanzhi; Dy, Jennifer (July 2023, International Conference on Machine Learning (ICML))

Full Text Available
All-in-One: A Highly Representative DNN Pruning Framework for Edge Devices with Dynamic Power Management

https://doi.org/10.1145/3508352.3549379

Gong, Yifan; Zhan, Zheng; Zhao, Pu; Wu, Yushu; Wu, Chao; Ding, Caiwen; Jiang, Weiwen; Qin, Minghai; Wang, Yanzhi (October 2022, Design Automation Conference (DAC))

Full Text Available
SparCL: Sparse Continual Learning on the Edge

Wang, Zifeng; Zhan, Zheng; Gong, Yifan; Yuan, Geng; Niu, Wei; Jian, Tong; Ren, Bin; Ioannidis, Stratis; Wang, Yanzhi; Dy, Jennifer (December 2022, 2022 Conference on Neural Information Processing Systems)

Existing work in continual learning (CL) focuses on mitigating catastrophic forgetting, i.e., model performance deterioration on past tasks when learning a new task. However, the training efficiency of a CL system is under-investigated, which limits the real-world application of CL systems under resource-limited scenarios. In this work, we propose a novel framework called Sparse Continual Learning(SparCL), which is the first study that leverages sparsity to enable cost-effective continual learning on edge devices. SparCL achieves both training acceleration and accuracy preservation through the synergy of three aspects: weight sparsity, data efficiency, and gradient sparsity. Specifically, we propose task-aware dynamic masking (TDM) to learn a sparse network throughout the entire CL process, dynamic data removal (DDR) to remove less informative training data, and dynamic gradient masking (DGM) to sparsify the gradient updates. Each of them not only improves efficiency, but also further mitigates catastrophic forgetting. SparCL consistently improves the training efficiency of existing state-of-the-art (SOTA) CL methods by at most 23X less training FLOPs, and, surprisingly, further improves the SOTA accuracy by at most 1.7%. SparCL also outperforms competitive baselines obtained from adapting SOTA sparse training methods to the CL setting in both efficiency and accuracy. We also evaluate the effectiveness of SparCL on a real mobile phone, further indicating the practical potential of our method.
more » « less
Full Text Available
SparCL: Sparse Continual Learning on the Edge

Wang, Zifeng; Zhan, Zheng; Gong, Yifan; Yuan, Geng; Niu, Wei; Jian, Tong; Ren, Bin; Ioannidis, Stratis; Wang, Yanzhi; Dy, Jennifer (November 2022, Neural Information Processing Systems (NeurIPS))

Full Text Available
Compiler-aware neural architecture search for on-mobile real-time super-resolution

Wu, Yushu; Gong, Yifan; Zhao, Pu; Li, Yanyu; Zhan, Zheng; Niu, Wei; Tang, Hao; Qin, Minghai; Ren, Bin; Wang, Yanzhi (November 2022, European Conference on Computer Vision (ECCV))

Full Text Available

« Prev Next »

Search for: All records