NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Differentially Private Confidence Intervals for Empirical Risk Minimization

https://doi.org/10.29012/jpc.660

Wang, Yue; Kifer, Daniel; Lee, Jaewoo (March 2019, Journal of Privacy and Confidentiality)

The process of data mining with differential privacy produces results that are affected by two types of noise: sampling noise due to data collection and privacy noise that is designed to prevent the reconstruction of sensitive information. In this paper, we consider the problem of designing confidence intervals for the parameters of a variety of differentially private machine learning models. The algorithms can provide confidence intervals that satisfy differential privacy (as well as the more recently proposed concentrated differential privacy) and can be used with existing differentially private mechanisms that train models using objective perturbation and output perturbation.
more » « less
Full Text Available
A Simple Baseline for Travel Time Estimation using Large-scale Trip Data

https://doi.org/10.1145/3293317

Wang, Hongjian; Tang, Xianfeng; Kuo, Yu-Hsuan; Kifer, Daniel; Li, Zhenhui (January 2019, ACM Transactions on Intelligent Systems and Technology)

Full Text Available
Statistical Approximating Distributions Under Differential Privacy

https://doi.org/10.29012/jpc.666

Wang, Yue; Kifer, Daniel; Lee, Jaewoo; Karwa, Vishesh (December 2018, Journal of Privacy and Confidentiality)

Statistics computed from data are viewed as random variables. When they are used for tasks like hypothesis testing and confidence intervals, their true finite sample distributions are often replaced by approximating distributions that are easier to work with (for example, the Gaussian, which results from using approximations justified by the Central Limit Theorem). When data are perturbed by differential privacy, the approximating distributions also need to be modified. Prior work provided various competing methods for creating such approximating distributions with little formal justification beyond the fact that they worked well empirically. In this paper, we study the question of how to generate statistical approximating distributions for differentially private statistics, provide finite sample guarantees for the quality of the approximations.
more » « less
Full Text Available
Detecting Violations of Differential Privacy

https://doi.org/10.1145/3243734.3243818

Ding, Zeyu; Wang, Yuxin; Wang, Guanhong; Zhang, Danfeng; Kifer, Daniel (October 2018, The 2018 ACM SIGSAC Conference on Computer and Communications Security)

Full Text Available
Concentrated Differentially Private Gradient Descent with Adaptive per-Iteration Privacy Budget

https://doi.org/10.1145/3219819.3220076

Lee, Jaewoo; Kifer, Daniel (August 2018, Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining)

Iterative algorithms, like gradient descent, are common tools for solving a variety of problems, such as model fitting. For this reason, there is interest in creating differentially private versions of them. However, their conversion to differentially private algorithms is often naive. For instance, a fixed number of iterations are chosen, the privacy budget is split evenly among them, and at each iteration, parameters are updated with a noisy gradient. In this paper, we show that gradient-based algorithms can be improved by a more careful allocation of privacy budget per iteration. Intuitively, at the beginning of the optimization, gradients are expected to be large, so that they do not need to be measured as accurately. However, as the parameters approach their optimal values, the gradients decrease and hence need to be measured more accurately. We add a basic line-search capability that helps the algorithm decide when more accurate gradient measurements are necessary. Our gradient descent algorithm works with the recently introduced zCDP version of differential privacy. It outperforms prior algorithms for model fitting and is competitive with the state-of-the-art for $(ε,δ)$-differential privacy, a strictly weaker definition than zCDP.
more » « less
Full Text Available

Search for: All records