NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

DP-BREM: Differentially-Private and Byzantine-Robust Federated Learning with Client Momentum

Gu, Xiaolan; Li, Ming; Xiong, Li (August 2025, USENIX Security Symposium)

Free, publicly-accessible full text available August 1, 2026
TransFed: cross-domain feature alignment for semi-supervised federated transfer learning

https://doi.org/10.1007/s10994-025-06805-1

Zeng, Linghui; Liu, Ruixuan; Xiong, Li; Ho, Joyce C (August 2025, Machine Learning)

Free, publicly-accessible full text available August 1, 2026
Contrastive Unlearning: A Contrastive Approach to Machine Unlearning

https://doi.org/10.24963/ijcai.2025/830

Lee, Hong kyu; Zhang, Qiuchen; Yang, Carl; Lou, Jian; Xiong, Li (September 2025, International Joint Conferences on Artificial Intelligence Organization)

Machine unlearning aims to eliminate the influence of a subset of training samples (i.e., unlearning samples) from a trained model. Effectively and efficiently removing the unlearning samples without negatively impacting the overall model performance is challenging. Existing works mainly exploit input and output space and classification loss, which can result in ineffective unlearning or performance loss. In addition, they utilize unlearning or remaining samples ineffectively, sacrificing either unlearning efficacy or efficiency. Our main insight is that the direct optimization on the representation space utilizing both unlearning and remaining samples can effectively remove influence of unlearning samples while maintaining representations learned from remaining samples. We propose a contrastive unlearning framework, leveraging the concept of representation learning for more effective unlearning. It removes the influence of unlearning samples by contrasting their embeddings against the remaining samples' embeddings so that their embeddings are closer to the embeddings of unseen samples. Experiments on a variety of datasets and models on both class unlearning and sample unlearning showed that contrastive unlearning achieves the best unlearning effects and efficiency with the lowest performance loss compared with the state-of-the-art algorithms. In addition, it is generalizable to different contrastive frameworks and other models such as vision-language models. Our main code is available on github.com/Emory-AIMS/Contrastive-Unlearning
more » « less
Free, publicly-accessible full text available September 1, 2026
Unraveling Complex Temporal Patterns in EHRs via Robust Irregular Tensor Factorization

Ren, Yifei; Zeng, Linghui; Lou, Jian; Xiong, Li; Ho, Joyce; Jiang, Xiaoqian; Bhavani, Sivasubramanium (June 2025, AMIA Jt Summits Transl Sci Proc)

Free, publicly-accessible full text available June 10, 2026
Privacy and Accuracy-Aware AI/ML Model Deduplication

https://doi.org/10.1145/3725340

Guan, Hong; Yu, Lei; Zhou, Lixi; Xiong, Li; Chowdhury, Kanchan; Xie, Lulu; Xiao, Xusheng; Zou, Jia (June 2025, Proceedings of the ACM on Management of Data)

With the growing adoption of privacy-preserving machine learning algorithms, such as Differentially Private Stochastic Gradient Descent (DP-SGD), training or fine-tuning models on private datasets has become increasingly prevalent. This shift has led to the need for models offering varying privacy guarantees and utility levels to satisfy diverse user requirements. Managing numerous versions of large models introduces significant operational challenges, including increased inference latency, higher resource consumption, and elevated costs. Model deduplication is a technique widely used by many model serving and database systems to support high-performance and low-cost inference queries and model diagnosis queries. However, none of the existing model deduplication works has considered privacy, leading to unbounded aggregation of privacy costs for certain deduplicated models and inefficiencies when applied to deduplicate DP-trained models. We formalize the problem of deduplicating DP-trained models for the first time and propose a novel privacy- and accuracy-aware deduplication mechanism to address the problem. We developed a greedy strategy to select and assign base models to target models to minimize storage and privacy costs. When deduplicating a target model, we dynamically schedule accuracy validations and apply the Sparse Vector Technique to reduce the privacy costs associated with private validation data. Compared to baselines, our approach improved the compression ratio by up to 35× for individual models (including large language models and vision transformers). We also observed up to 43× inference speedup due to the reduction of I/O operations.
more » « less
Free, publicly-accessible full text available June 17, 2026
Tokens for Learning, Tokens for Unlearning: Mitigating Membership Inference Attacks in Large Language Models via Dual-Purpose Training

https://doi.org/10.18653/v1/2025.findings-acl.1174

Tran, Toan; Liu, Ruixuan; Xiong, Li (January 2025, Association for Computational Linguistics)

Free, publicly-accessible full text available January 1, 2026
Simulated Infectious Diseases Datasets with Controlled Data Bias

https://doi.org/10.1145/3711896.3737401

Kong, Ruochen; Anderson, Taylor; Scotch, Matthew; Heslop, David J; Khaokaew, Yonchanok; Xue, Hao; Xiong, Li; MacIntyre, Chandini Raina; Salim, Flora D; Züfle, Andreas (August 2025, ACM)

Free, publicly-accessible full text available August 3, 2026
PreCurious: How Innocent Pre-Trained Language Models Turn into Privacy Traps

https://doi.org/10.1145/3658644.3690279

Liu, Ruixuan; Wang, Tianhao; Cao, Yang; Xiong, Li (December 2024, ACM)

Free, publicly-accessible full text available December 2, 2025
Cross-silo Federated Learning with Record-level Personalized Differential Privacy

https://doi.org/10.1145/3658644.3670351

Liu, Junxu; Lou, Jian; Xiong, Li; Liu, Jinfei; Meng, Xiaofeng (December 2024, ACM)

Free, publicly-accessible full text available December 2, 2025
Federated Node Classification over Distributed Ego-Networks with Secure Contrastive Embedding Sharing

https://doi.org/10.1145/3627673.3679834

Xie, Han; Xiong, Li; Yang, Carl (October 2024, ACM)

Full Text Available

« Prev Next »

Search for: All records