NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Safe and Robust Watermark Injection with a Single OoD Image

Yu, Shuyang; Hong, Junyuan; Zhang, Haobo; Wang, Haotao; Wang, Zhangyang; Zhou, Jiayu (January 2024, 2024 International Conference on Learning Representations)

Full Text Available
Revisiting Data-Free Knowledge Distillation with Poisoned Teachers

Hong, Junyuan; Zeng, Yi; Yu, Shuyang; Lyu, Lingjuan; Jia, Ruoxi; Zhou, Jiayu (July 2023, Proceedings of the 40th International Conference on Machine Learning)

Data-free knowledge distillation (KD) helps transfer knowledge from a pre-trained model (known as the teacher model) to a smaller model (known as the student model) without access to the original training data used for training the teacher model. However, the security of the synthetic or out-of-distribution (OOD) data required in data-free KD is largely unknown and under-explored. In this work, we make the first effort to uncover the security risk of data-free KD w.r.t. untrusted pre-trained models. We then propose Anti-Backdoor Data-Free KD (ABD), the first plug-in defensive method for data-free KD methods to mitigate the chance of potential backdoors being transferred. We empirically evaluate the effectiveness of our proposed ABD in diminishing transferred backdoor knowledge while maintaining compatible downstream performances as the vanilla KD. We envision this work as a milestone for alarming and mitigating the potential backdoors in data-free KD. Codes are released at https://github.com/illidanlab/ABD .
more » « less
Full Text Available
Turning the Curse of Heterogeneity in Federated Learning into a Blessing for Out-of-Distribution Detection

Yu, Shuyang; Hong, Junyuan; Wang, Haotao; Wang, Zhangyang; Zhou, Jiayu (April 2023, International Conference on Learning Representations (ICLR))

Deep neural networks have witnessed huge successes in many challenging prediction tasks and yet they often suffer from out-of-distribution (OoD) samples, misclassifying them with high confidence. Recent advances show promising OoD detection performance for centralized training, and however, OoD detection in federated learning (FL) is largely overlooked, even though many security sensitive applications such as autonomous driving and voice recognition authorization are commonly trained using FL for data privacy concerns. The main challenge that prevents previous state-of-the-art OoD detection methods from being incorporated to FL is that they require large amount of real OoD samples. However, in real-world scenarios, such large-scale OoD training data can be costly or even infeasible to obtain, especially for resource-limited local devices. On the other hand, a notorious challenge in FL is data heterogeneity where each client collects non-identically and independently distributed (non-iid) data. We propose to take advantage of such heterogeneity and turn the curse into a blessing that facilitates OoD detection in FL. The key is that for each client, non-iid data from other clients (unseen external classes) can serve as an alternative to real OoD samples. Specifically, we propose a novel Federated Out-of-Distribution Synthesizer (FOSTER), which learns a class-conditional generator to synthesize virtual external-class OoD samples, and maintains data confidentiality and communication efficiency required by FL. Experimental results show that our method outperforms the state-of-the-art by 2.49%, 2.88%, 1.42% AUROC, and 0.01%, 0.89%, 1.74% ID accuracy, on CIFAR-10, CIFAR-100, and STL10, respectively.
more » « less
Full Text Available
SAFE AND ROBUST WATERMARK INJECTION WITH A SINGLE OOD IMAGE

Yu, Shuyang; Hong, Junyuan; Zhang, Haobo; Wang, Haotao; Wang, Zhangyang; Zhou, Jiayu (May 2023, International Conference on Learning Representations (ICLR) 2023)

Full Text Available
Turning the Curse of Heterogeneity in Federated Learning into a Blessing for Out-of-Distribution Detection

Yu, Shuyang; Hong, Junyuan; Wang, Haotao; Wang, Zhangyang; Zhou, Jiayu (February 2023, 2023 International Conference on Learning Representations)

Deep neural networks have witnessed huge successes in many challenging prediction tasks and yet they often suffer from out-of-distribution (OoD) samples, misclassifying them with high confidence. Recent advances show promising OoD detection performance for centralized training, and however, OoD detection in federated learning (FL) is largely overlooked, even though many security sensitive applications such as autonomous driving and voice recognition authorization are commonly trained using FL for data privacy concerns. The main challenge that prevents previous state-of-the-art OoD detection methods from being incorporated to FL is that they require large amount of real OoD samples. However, in real-world scenarios, such large-scale OoD training data can be costly or even infeasible to obtain, especially for resource-limited local devices. On the other hand, a notorious challenge in FL is data heterogeneity where each client collects non-identically and independently distributed (non-iid) data. We propose to take advantage of such heterogeneity and turn the curse into a blessing that facilitates OoD detection in FL. The key is that for each client, non-iid data from other clients (unseen external classes) can serve as an alternative to real OoD samples. Specifically, we propose a novel Federated Out-of-Distribution Synthesizer (FOSTER), which learns a class-conditional generator to synthesize virtual external-class OoD samples, and maintains data confidentiality and communication efficiency required by FL. Experimental results show that our method outperforms the state-of-the-art by 2.49%, 2.88%, 1.42% AUROC, and 0.01%, 0.89%, 1.74% ID accuracy, on CIFAR-10, CIFAR-100, and STL10, respectively.
more » « less
Full Text Available
Robust Unsupervised Domain Adaptation from A Corrupted Source

https://doi.org/10.1109/ICDM54844.2022.00171

Yu, Shuyang; Zhu, Zhuangdi; Liu, Boyang; Jain, Anil K.; Zhou, Jiayu (November 2022, 2022 IEEE International Conference on Data Mining (ICDM))

Full Text Available
Federated Adversarial Debiasing for Fair and Transferable Representations

https://doi.org/10.1145/3447548.3467281

Hong, Junyuan; Zhu, Zhuangdi; Yu, Shuyang; Wang, Zhangyang; Dodge, Hiroko H.; Zhou, Jiayu (August 2021, Proceedings of the 27th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining)

Full Text Available
Tackle Balancing Constraint for Incremental Semi-Supervised Support Vector Learning

Yu, Shuyang; Gu, Bin; Ning, Kunpeng; Chen, Haiyan; Pei, Jian; Huang, Heng (January 2019, 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2019))

Full Text Available

Search for: All records