Search for: All records

Creators/Authors contains: "Liu, Sijia"

« Prev Next »

Total Resources

45

Resource Type
Conference Paper

36

Conference Proceeding

2

Dataset

0

Journal Article

7

Workshop Report

0

Availability
Full Text / Resource Available

38

Citation Only

7

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Model Sparsity Can Simplify Machine Unlearning

Jia, Jinghan ; Liu, Jiancheng ; Ram, Parikshit ; Yao, Yuguang ; Liu, Gaowen ; Liu, Yang ; Sharma, Pranay ; Liu, Sijia ( December 2023 , The Thirty-eighth Annual Conference on Neural Information Processing Systems)

In response to recent data regulation requirements, machine unlearning (MU) has emerged as a critical process to remove the influence of specific examples from a given model. Although exact unlearning can be achieved through complete model retraining using the remaining dataset, the associated computational costs have driven the development of efficient, approximate unlearning techniques. Moving beyond data-centric MU approaches, our study introduces a novel model-based perspective: model sparsification via weight pruning, which is capable of reducing the gap between exact unlearning and approximate unlearning. We show in both theory and practice that model sparsity can boost the multi-criteria unlearning performance of an approximate unlearner, closing the approximation gap, while continuing to be efficient. This leads to a new MU paradigm, termed prune first, then unlearn, which infuses a sparse model prior into the unlearning process. Building on this insight, we also develop a sparsity-aware unlearning method that utilizes sparsity regularization to enhance the training process of approximate unlearning. Extensive experiments show that our proposals consistently benefit MU in various unlearning scenarios. A notable highlight is the 77% unlearning efficacy gain of fine-tuning (one of the simplest unlearning methods) when using sparsity-aware unlearning. Furthermore, we demonstrate the practical impact of our proposed MU methods in addressing other machine learning challenges, such as defending against backdoor attacks and enhancing transfer learning. Codes are available at this https URL.
more » « less
Free, publicly-accessible full text available December 9, 2024
Designing a Direct Feedback Loop between Humans and Convolutional Neural Networks through Local Explanations

https://doi.org/10.1145/3610187

Sun, Tong Steven ; Gao, Yuyang ; Khaladkar, Shubham ; Liu, Sijia ; Zhao, Liang ; Kim, Young-Ho ; Hong, Sungsoo Ray ( September 2023 , Proceedings of the ACM on Human-Computer Interaction)

The local explanation provides heatmaps on images to explain how Convolutional Neural Networks (CNNs) derive their output. Due to its visual straightforwardness, the method has been one of the most popular explainable AI (XAI) methods for diagnosing CNNs. Through our formative study (S1), however, we captured ML engineers' ambivalent perspective about the local explanation as a valuable and indispensable envision in building CNNs versus the process that exhausts them due to the heuristic nature of detecting vulnerability. Moreover, steering the CNNs based on the vulnerability learned from the diagnosis seemed highly challenging. To mitigate the gap, we designed DeepFuse, the first interactive design that realizes the direct feedback loop between a user and CNNs in diagnosing and revising CNN's vulnerability using local explanations. DeepFuse helps CNN engineers to systemically search unreasonable local explanations and annotate the new boundaries for those identified as unreasonable in a labor-efficient manner. Next, it steers the model based on the given annotation such that the model doesn't introduce similar mistakes. We conducted a two-day study (S2) with 12 experienced CNN engineers. Using DeepFuse, participants made a more accurate and reasonable model than the current state-of-the-art. Also, participants found the way DeepFuse guides case-based reasoning can practically improve their current practice. We provide implications for design that explain how future HCI-driven design can move our practice forward to make XAI-driven insights more actionable.

more » « less
Free, publicly-accessible full text available September 28, 2024
Patch-level Routing in Mixture-of-Experts is Provably Sample-efficient for Convolutional Neural Networks

Chowdhury, Mohammed Nowaz ; Zhang, Shuai ; Wang, Meng ; Liu, Sijia ; Chen, Pin-Yu. ( July 2023 , Proc. of 2023 International Conference on Machine Learning (ICML))
Linearly Constrained Bilevel Optimization: A Smoothed Implicit Gradient Approach

Khanduri, Prashant ; Tsaknakis, Ioannis ; Zhang, Yihua ; Liu, Jia ; Liu, Sijia ; Zhang, Jiawei ; Hong, Mingyi ( July 2023 , Proc. ICML)

Free, publicly-accessible full text available July 23, 2024
Linearly Constrained Bilevel Optimization: A Smoothed Implicit Gradient Approach

Khanduri, Prashant ; Tsaknakis, Ioannis ; Zhang, Yihua ; Liu, Jia ; Liu, Sijia ; Zhang, Jiawei ; Hong, Mingyi ( July 2023 , Proc. ICML)

Free, publicly-accessible full text available July 23, 2024
A Theoretical Understanding of Vision Transformers: Learning, Generalization, and Sample Complexity

Li Hongkang Li ; Wang, Meng Wang ; Liu, Sijia ; Chen, Pin-Yu. ( May 2023 , Proc. the Eleventh International Conference on Learning Representations (ICLR))

Free, publicly-accessible full text available May 1, 2024
Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks

Zhang, Shuai ; Wang, Meng ; Chen, Pin-Yu ; Liu, Sijia ; Lu, Songtao ; Liu, Miao. ( May 2023 , Proc. the Eleventh International Conference on Learning Representations (ICLR))

Free, publicly-accessible full text available May 1, 2024
Advancing Model Pruning via Bi-level Optimization

Zhang, Yihua ; Yao, Yuguang ; Ram, Parikshit ; Zhao, Pu ; Chen, Tianlong ; Hong, Mingyi ; Wang, Yanzhi ; Liu, Sijia ( December 2022 , 36th Conference on Neural Information Processing Systems (NeurIPS 2022))
Data-Efficient Double-Win Lottery Tickets from Robust Pre-training

Chen, Tianlong ; Zhang, Zhenyu ; Liu, Sijia ; Zhang, Yang ; Chang, Shiyu ; Wang, Zhangyang ( July 2022 , International Conference on Machine Learning)

Full Text Available
Data-Efficient Double-Win Lottery Tickets from Robust Pre-training

Chen, Tianlong ; Zhang, Zhenyu ; Liu, Sijia ; Zhang, Yang ; Chang, Shiyu ; Wang, Zhangyang ( July 2022 , International Conference on Machine Learning (ICML))

Pre-training serves as a broadly adopted starting point for transfer learning on various downstream tasks. Recent investigations of lottery tickets hypothesis (LTH) demonstrate such enormous pre-trained models can be replaced by extremely sparse subnetworks (a.k.a. matching subnetworks) without sacrificing transferability. However, practical security-crucial applications usually pose more challenging requirements beyond standard transfer, which also demand these subnetworks to overcome adversarial vulnerability. In this paper, we formulate a more rigorous concept, Double-Win Lottery Tickets, in which a located subnetwork from a pre-trained model can be independently transferred on diverse downstream tasks, to reach BOTH the same standard and robust generalization, under BOTH standard and adversarial training regimes, as the full pre-trained model can do. We comprehensively examine various pre-training mechanisms and find that robust pre-training tends to craft sparser double-win lottery tickets with superior performance over the standard counterparts. For example, on downstream CIFAR-10/100 datasets, we identify double-win matching subnetworks with the standard, fast adversarial, and adversarial pre-training from ImageNet, at 89.26%/73.79%, 89.26%/79.03%, and 91.41%/83.22% sparsity, respectively. Furthermore, we observe the obtained double-win lottery tickets can be more data-efficient to transfer, under practical data-limited (e.g., 1% and 10%) downstream schemes. Our results show that the benefits from robust pre-training are amplified by the lottery ticket scheme, as well as the data-limited transfer setting.
more » « less
Full Text Available

« Prev Next »