NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Translingual Language Markers for Cognitive Assessment from Spontaneous Speech

https://doi.org/10.21437/Interspeech.2024-1422

Hoang, Bao; Pang, Yijiang; Dodge, Hiroko; Zhou, Jiayu (September 2024, 25th Interspeech Conference)

Full Text Available
Distributed Harmonization: Federated Clustered Batch Effect Adjustment and Generalization

https://doi.org/10.1145/3637528.3671590

Hoang, Bao; Pang, Yijiang; Liang, Siqi; Zhan, Liang; Thompson, Paul M; Zhou, Jiayu (August 2024, ACM)

Full Text Available
On the Generalization Ability of Unsupervised Pretraining

Deng, Yuyang; Hong, Junyuan; Zhou, Jiayu; Mahdavi, Mehrdad (March 2024, International Conference on Artificial Intelligence and Statistics)

Recent advances in unsupervised learning have shown that unsupervised pre-training, followed by fine-tuning, can improve model generalization. However, a rigorous understanding of how the representation function learned on an unlabeled dataset affects the generalization of the fine-tuned model is lacking. Existing theoretical research does not adequately account for the heterogeneity of the distribution and tasks in pre-training and fine-tuning stage. To bridge this gap, this paper introduces a novel theoretical framework that illuminates the critical factor influencing the transferability of knowledge acquired during unsupervised pre-training to the subsequent fine-tuning phase, ultimately affecting the generalization capabilities of the fine-tuned model on downstream tasks. We apply our theoretical framework to analyze generalization bound of two distinct scenarios: Context Encoder pre-training with deep neural networks and Masked Autoencoder pre-training with deep transformers, followed by fine-tuning on a binary classification task. Finally, inspired by our findings, we propose a novel regularization method during pre-training to further enhances the generalization of fine-tuned model. Overall, our results contribute to a better understanding of unsupervised pre-training and fine-tuning paradigm, and can shed light on the design of more effective pre-training algorithms.
more » « less
Full Text Available
Safe and Robust Watermark Injection with a Single OoD Image

Yu, Shuyang; Hong, Junyuan; Zhang, Haobo; Wang, Haotao; Wang, Zhangyang; Zhou, Jiayu (January 2024, 2024 International Conference on Learning Representations)

Full Text Available
Subject Harmonization of Digital Biomarkers: Improved Detection of Mild Cognitive Impairment from Language Markers

https://doi.org/10.1142/9789811286421_0015

Hoang, Bao; Pang, Yijiang; Dodge, Hiroko H; Zhou, Jiayu (December 2023, WORLD SCIENTIFIC)

Full Text Available
Understanding Deep Gradient Leakage via Inversion Influence Functions

Zhang, Haobo; Hong, Junyuan; Deng, Yuyang; Mahdavi, Mehrdad; Zhou, Jiayu (September 2023, 2023 Conference on Neural Information Processing Systems)

Full Text Available
Revisiting Data-Free Knowledge Distillation with Poisoned Teachers

Hong, Junyuan; Zeng, Yi; Yu, Shuyang; Lyu, Lingjuan; Jia, Ruoxi; Zhou, Jiayu (July 2023, Proceedings of the 40th International Conference on Machine Learning)

Data-free knowledge distillation (KD) helps transfer knowledge from a pre-trained model (known as the teacher model) to a smaller model (known as the student model) without access to the original training data used for training the teacher model. However, the security of the synthetic or out-of-distribution (OOD) data required in data-free KD is largely unknown and under-explored. In this work, we make the first effort to uncover the security risk of data-free KD w.r.t. untrusted pre-trained models. We then propose Anti-Backdoor Data-Free KD (ABD), the first plug-in defensive method for data-free KD methods to mitigate the chance of potential backdoors being transferred. We empirically evaluate the effectiveness of our proposed ABD in diminishing transferred backdoor knowledge while maintaining compatible downstream performances as the vanilla KD. We envision this work as a milestone for alarming and mitigating the potential backdoors in data-free KD. Codes are released at https://github.com/illidanlab/ABD .
more » « less
Full Text Available
Federated Robustness Propagation: Sharing Adversarial Robustness in Heterogeneous Federated Learning

https://doi.org/10.1609/aaai.v37i7.25955

Hong, Junyuan; Wang, Haotao; Wang, Zhangyang; Zhou, Jiayu (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

Federated learning (FL) emerges as a popular distributed learning schema that learns a model from a set of participating users without sharing raw data. One major challenge of FL comes with heterogeneous users, who may have distributionally different (or non-iid) data and varying computation resources. As federated users would use the model for prediction, they often demand the trained model to be robust against malicious attackers at test time. Whereas adversarial training (AT) provides a sound solution for centralized learning, extending its usage for federated users has imposed significant challenges, as many users may have very limited training data and tight computational budgets, to afford the data-hungry and costly AT. In this paper, we study a novel FL strategy: propagating adversarial robustness from rich-resource users that can afford AT, to those with poor resources that cannot afford it, during federated learning. We show that existing FL techniques cannot be effectively integrated with the strategy to propagate robustness among non-iid users and propose an efficient propagation approach by the proper use of batch-normalization. We demonstrate the rationality and effectiveness of our method through extensive experiments. Especially, the proposed method is shown to grant federated models remarkable robustness even when only a small portion of users afford AT during learning. Source code can be accessed at https://github.com/illidanlab/FedRBN.
more » « less
Full Text Available
How Robust is Your Fairness? Evaluating and Sustaining Fairness under Unseen Distribution Shifts

Wang, Haotao; Hong, Junyuan; Zhou, Jiayu; Wang, Zhangyang (March 2023, Transactions on machine learning research)
Niu, Gang (Ed.)
Increasing concerns have been raised on deep learning fairness in recent years. Existing fairness-aware machine learning methods mainly focus on the fairness of in-distribution data. However, in real-world applications, it is common to have distribution shift between the training and test data. In this paper, we first show that the fairness achieved by existing methods can be easily broken by slight distribution shifts. To solve this problem, we propose a novel fairness learning method termed CUrvature MAtching (CUMA), which can achieve robust fairness generalizable to unseen domains with unknown distributional shifts. Specifically, CUMA enforces the model to have similar generalization ability on the majority and minority groups, by matching the loss curvature distributions of the two groups. We evaluate our method on three popular fairness datasets. Compared with existing methods, CUMA achieves superior fairness under unseen distribution shifts, without sacrificing either the overall accuracy or the in-distribution fairness.
more » « less
Full Text Available
Turning the Curse of Heterogeneity in Federated Learning into a Blessing for Out-of-Distribution Detection

Yu, Shuyang; Hong, Junyuan; Wang, Haotao; Wang, Zhangyang; Zhou, Jiayu (February 2023, 2023 International Conference on Learning Representations)

Deep neural networks have witnessed huge successes in many challenging prediction tasks and yet they often suffer from out-of-distribution (OoD) samples, misclassifying them with high confidence. Recent advances show promising OoD detection performance for centralized training, and however, OoD detection in federated learning (FL) is largely overlooked, even though many security sensitive applications such as autonomous driving and voice recognition authorization are commonly trained using FL for data privacy concerns. The main challenge that prevents previous state-of-the-art OoD detection methods from being incorporated to FL is that they require large amount of real OoD samples. However, in real-world scenarios, such large-scale OoD training data can be costly or even infeasible to obtain, especially for resource-limited local devices. On the other hand, a notorious challenge in FL is data heterogeneity where each client collects non-identically and independently distributed (non-iid) data. We propose to take advantage of such heterogeneity and turn the curse into a blessing that facilitates OoD detection in FL. The key is that for each client, non-iid data from other clients (unseen external classes) can serve as an alternative to real OoD samples. Specifically, we propose a novel Federated Out-of-Distribution Synthesizer (FOSTER), which learns a class-conditional generator to synthesize virtual external-class OoD samples, and maintains data confidentiality and communication efficiency required by FL. Experimental results show that our method outperforms the state-of-the-art by 2.49%, 2.88%, 1.42% AUROC, and 0.01%, 0.89%, 1.74% ID accuracy, on CIFAR-10, CIFAR-100, and STL10, respectively.
more » « less
Full Text Available

« Prev Next »

Search for: All records