NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Stochastic Privacy-Preserving Methods for Nonconvex Sparse Learning

https://doi.org/10.1016/j.ins.2022.09.062

Liang, Guannan; Tong, Qianqian; Ding, Jiahao; Pan, Miao; Bi, Jinbo (October 2022, Information Sciences)

Full Text Available
Federated Optimization of ℓ0-norm Regularized Sparse Learning

https://doi.org/10.3390/a15090319

Tong, Qianqian; Liang, Guannan; Ding, Jiahao; Zhu, Tan; Pan, Miao; Bi, Jinbo (September 2022, Algorithms)

Regularized sparse learning with the ℓ0-norm is important in many areas, including statistical learning and signal processing. Iterative hard thresholding (IHT) methods are the state-of-the-art for nonconvex-constrained sparse learning due to their capability of recovering true support and scalability with large datasets. The current theoretical analysis of IHT assumes the use of centralized IID data. In realistic large-scale scenarios, however, data are distributed, seldom IID, and private to edge computing devices at the local level. Consequently, it is required to study the property of IHT in a federated environment, where local devices update the sparse model individually and communicate with a central server for aggregation infrequently without sharing local data. In this paper, we propose the first group of federated IHT methods: Federated Hard Thresholding (Fed-HT) and Federated Iterative Hard Thresholding (FedIter-HT) with theoretical guarantees. We prove that both algorithms have a linear convergence rate and guarantee for recovering the optimal sparse estimator, which is comparable to classic IHT methods, but with decentralized, non-IID, and unbalanced data. Empirical results demonstrate that the Fed-HT and FedIter-HT outperform their competitor—a distributed IHT, in terms of reducing objective values with fewer communication rounds and bandwidth requirements.
more » « less
Full Text Available
IoT Device Friendly and Communication-Efficient Federated Learning via Joint Model Pruning and Quantization

https://doi.org/10.1109/JIOT.2022.3145865

Prakash, Pavana; Ding, Jiahao; Chen, Rui; Qin, Xiaoqi; Shu, Minglei; Cui, Qimei; Guo, Yuanxiong; Pan, Miao (August 2022, IEEE Internet of Things Journal)

Full Text Available
To Talk or to Work: Delay Efficient Federated Learning over Mobile Edge Devices

https://doi.org/10.1109/GLOBECOM46510.2021.9685793

Prakash, Pavana; Ding, Jiahao; Wu, Maoqiang; Shu, Minglei; Yu, Rong; Pan, Miao (December 2021, IEEE Global Communications Conference 2021)

Full Text Available
Incentivizing Differentially Private Federated Learning: A Multi-Dimensional Contract Approach

https://doi.org/10.1109/JIOT.2021.3050163

Wu, Maoqiang; Ye, Dongdong; Ding, Jiahao; Guo, Yuanxiong; Yu, Rong; Pan, Miao (January 2021, IEEE Internet of Things Journal)
null (Ed.)
Full Text Available
Adaptive Privacy Preserving Deep Learning Algorithms for Medical Data

Zhang, Xinyue; Ding, Jiahao; Wu, Maoqiang; Wong, Stephen TC.; Nguyen, Hien V.; Pan, Miao (January 2021, IEEE Winter Conference on Applications of Computer Vision)
null (Ed.)
Deep learning holds a great promise of revolutionizing healthcare and medicine. Unfortunately, various inference attack models demonstrated that deep learning puts sensitive patient information at risk. The high capacity of deep neural networks is the main reason behind the privacy loss. In particular, patient information in the training data can be unintentionally memorized by a deep network. Adversarial parties can extract that information given the ability to access or query the network. In this paper, we propose a novel privacy-preserving mechanism for training deep neural networks. Our approach adds decaying Gaussian noise to the gradients at every training iteration. This is in contrast to the mainstream approach adopted by Google's TensorFlow Privacy, which employs the same noise scale in each step of the whole training process. Compared to existing methods, our proposed approach provides an explicit closed-form mathematical expression to approximately estimate the privacy loss. It is easy to compute and can be useful when the users would like to decide proper training time, noise scale, and sampling ratio during the planning phase. We provide extensive experimental results using one real-world medical dataset (chest radiographs from the CheXpert dataset) to validate the effectiveness of the proposed approach. The proposed differential privacy based deep learning model achieves significantly higher classification accuracy over the existing methods with the same privacy budget.
more » « less
Full Text Available
Private Empirical Risk Minimization with Analytic Gaussian Mechanism for Healthcare System

https://doi.org/10.1109/TBDATA.2020.2997732

Ding, Jiahao; Errapotu, Sai Mounika; Guo, Yuanxiong; Zhang, Haixia; Yuan, Dongfeng; Pan, Miao (January 2020, IEEE Transactions on Big Data)
null (Ed.)
Full Text Available
Stochastic ADMM Based Distributed Machine Learning with Differential Privacy

https://doi.org/10.1007/978-3-030-37228-6_13

Ding, Jiahao; Errapotu, Sai Mounika; Zhang, Haijun; Gong, Yanmin; Pan, Miao; Han, Zhu (December 2019, Lecture notes of the Institute for Computer Sciences Social Informatics and Telecommunications Engineering)

While embracing various machine learning techniques to make effective decisions in the big data era, preserving the privacy of sensitive data poses significant challenges. In this paper, we develop a privacy-preserving distributed machine learning algorithm to address this issue. Given the assumption that each data provider owns a dataset with different sample size, our goal is to learn a common classifier over the union of all the local datasets in a distributed way without leaking any sensitive information of the data samples. Such an algorithm needs to jointly consider efficient distributed learning and effective privacy preservation. In the proposed algorithm, we extend stochastic alternating direction method of multipliers (ADMM) in a distributed setting to do distributed learning. For preserving privacy during the iterative process, we combine differential privacy and stochastic ADMM together. In particular, we propose a novel stochastic ADMM based privacy-preserving distributed machine learning (PS-ADMM) algorithm by perturbing the updating gradients, that provide differential privacy guarantee and have a low computational cost. We theoretically demonstrate the convergence rate and utility bound of our proposed PS-ADMM under strongly convex objective. Through our experiments performed on real-world datasets, we show that PS-ADMM outperforms other differentially private ADMM algorithms under the same differential privacy guarantee.
more » « less
Full Text Available

Search for: All records