NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

PSBD: Prediction shift uncertainty unlocks backdoor detection

Li, Wei; Chen, Pin-Yu; Liu, Sijia; Wang, Ren (July 2025, Proceedings of the Computer Vision and Pattern Recognition Conference)

Free, publicly-accessible full text available July 14, 2026
Rethinking Evaluation Metrics for Machine Unlearning

Shi, Yingdan; Liu, Sijia; Wang, Ren (May 2025, ICML 2025 Workshop MUGen)

Free, publicly-accessible full text available May 1, 2026
SEUF: Is Unlearning One Expert Enough for Mixture-of-Experts LLMs?

https://doi.org/10.18653/v1/2025.acl-long.424

Zhuang, Haomin; Zhang, Yihua; Guo, Kehan; Jia, Jinghan; Liu, Gaowen; Liu, Sijia; Zhang, Xiangliang (July 2025, Association for Computational Linguistics)

Free, publicly-accessible full text available July 1, 2026
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers

Li, Hongkang; Zhang, Yihua; Zhang, Shuai; Chen, Pin-Yu; Liu, Sijia; Wang, Meng (April 2025, The Thirteenth International Conference on Learning Representations (ICLR))

Free, publicly-accessible full text available April 30, 2026
From Trojan Horses to Castle Walls: Unveiling Bilateral Data Poisoning Effects in Diffusion Models

Pan, Zhuoshi; Yao, Yuguang; Liu, Gaowen; Shen, Bingquan; Zhao, H Vicky; Kompella, Ramana Rao; Liu, Sijia (December 2024, neurips)

Full Text Available
Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models

Zhang, Yimeng; Chen, Xin; Jia, Jinghan; Zhang, Yihua; Fan, Chongyu; Liu, Jiancheng; Hong, Mingyi; Ding, Ke; Liu, Sijia (December 2024, neurips)

Full Text Available
Rethinking machine unlearning for large language models

https://doi.org/10.1038/s42256-025-00985-0

Liu, Sijia; Yao, Yuanshun; Jia, Jinghan; Casper, Stephen; Baracaldo, Nathalie; Hase, Peter; Yao, Yuguang; Liu, Chris Yuhao; Xu, Xiaojun; Li, Hang; et al (February 2025, Nature Machine Intelligence)

Free, publicly-accessible full text available February 1, 2026
To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy to Generate Unsafe Images ... For Now

https://doi.org/10.1007/978-3-031-72998-0_22

Zhang, Yimeng; Jia, Jinghan; Chen, Xin; Chen, Aochuan; Zhang, Yihua; Liu, Jiancheng; Ding, Ke; Liu, Sijia (September 2024, Springer Nature Switzerland)

Full Text Available
Backdoor Secrets Unveiled: Identifying Backdoor Data with Optimized Scaled Prediction Consistency

Pal, Soumyadeep; Yao, Yuguang; Wang, Ren; Shen, Bingquan; Liu, Sijia (May 2024, The Twelfth International Conference on Learning Representations)

Modern machine learning (ML) systems demand substantial training data, often resorting to external sources. Nevertheless, this practice renders them vulnerable to backdoor poisoning attacks. Prior backdoor defense strategies have primarily focused on the identification of backdoored models or poisoned data characteristics, typically operating under the assumption of access to clean data. In this work, we delve into a relatively underexplored challenge: the automatic identification of backdoor data within a poisoned dataset, all under realistic conditions, i.e., without the need for additional clean data or without manually defining a threshold for backdoor detection. We draw an inspiration from the scaled prediction consistency (SPC) technique, which exploits the prediction invariance of poisoned data to an input scaling factor. Based on this, we pose the backdoor data identification problem as a hierarchical data splitting optimization problem, leveraging a novel SPC-based loss function as the primary optimization objective. Our innovation unfolds in several key aspects. First, we revisit the vanilla SPC method, unveiling its limitations in addressing the proposed backdoor identification problem. Subsequently, we develop a bi-level optimization-based approach to precisely identify backdoor data by minimizing the advanced SPC loss. Finally, we demonstrate the efficacy of our proposal against a spectrum of backdoor attacks, encompassing basic label-corrupted attacks as well as more sophisticated clean-label attacks, evaluated across various benchmark datasets. Experiment results show that our approach often surpasses the performance of current baselines in identifying backdoor data points, resulting in about 4%-36% improvement in average AUROC.
more » « less
Full Text Available
An Introduction to Bilevel Optimization: Foundations and applications in signal processing and machine learning

https://doi.org/10.1109/MSP.2024.3358284

Zhang, Yihua; Khanduri, Prashant; Tsaknakis, Ioannis; Yao, Yuguang; Hong, Mingyi; Liu, Sijia (April 2024, IEEE Signal Processing Magazine)

Full Text Available

« Prev Next »

Search for: All records