NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Unified lower bounds for interactive high-dimensional estimation under information constraints

Acharya, Jayadev; Canonne, Cl'ement L; Sun, Ziteng; Tyagi, Himanshu (December 2023, Curran Associates, Inc.)
Oh, A; Naumann, T; Globerson, A; Saenko, K; Hardt, M; Levine, S (Ed.)
We consider distributed parameter estimation using interactive protocols subject to local information constraints such as bandwidth limitations, local differential privacy, and restricted measurements. We provide a unified framework enabling us to derive a variety of (tight) minimax lower bounds for different parametric families of distributions, both continuous and discrete, under any Lp loss. Our lower bound framework is versatile and yields “plug-and-play” bounds that are widely applicable to a large range of estimation problems, and, for the prototypical case of the Gaussian family, circumvents limitations of previous techniques. In particular, our approach recovers bounds obtained using data processing inequalities and Cramér–Rao bounds, two other alternative approaches for proving lower bounds in our setting of interest. Further, for the families considered, we complement our lower bounds with matching upper bounds.
more » « less
Full Text Available
User-level Private Stochastic Convex Optimization with Optimal Rates

Bassily, Raef; Sun, Ziteng (July 2023, Proceedings of Machine Learning Research - The 40th International Conference on Machine Learning)

Full Text Available
User-level Private Stochastic Convex Optimization with Optimal Rates

Bassily, Raef; Sun, Ziteng (July 2023, PMLR: Volume 202)

Full Text Available
Discrete distribution estimation under user-level local differential privacy

Acharya, Jayadev; Liu, Yuhan; Sun, Ziteng (April 2023, Proceedings of Machine Learning Research)
Ruiz, Francisco; Dy, Jennifer; van de Meent, Jan-Willem (Ed.)
We study discrete distribution estimation under user-level local differential privacy (LDP). In user-level $$\varepsilon$$-LDP, each user has $$m\ge1$$ samples and the privacy of all $$m$$ samples must be preserved simultaneously. We resolve the following dilemma: While on the one hand having more samples per user should provide more information about the underlying distribution, on the other hand, guaranteeing the privacy of all $$m$$ samples should make the estimation task more difficult. We obtain tight bounds for this problem under almost all parameter regimes. Perhaps surprisingly, we show that in suitable parameter regimes, having $$m$$ samples per user is equivalent to having $$m$$ times more users, each with only one sample. Our results demonstrate interesting phase transitions for $$m$$ and the privacy parameter $$\varepsilon$$ in the estimation risk. Finally, connecting with recent results on shuffled DP, we show that combined with random shuffling, our algorithm leads to optimal error guarantees (up to logarithmic factors) under the central model of user-level DP in certain parameter regimes. We provide several simulations to verify our theoretical findings.
more » « less
Full Text Available
Sample Complexity of Distinguishing Cause from Effect

Acharya, Jayadev; Bhadane, Sourbh; Bhattacharyya, Arnab; Kandasamy, Saravanan; Sun, Ziteng (April 2023, Proceedings of Machine Learning Research)
Ruiz, Francisco; Dy, Jennifer; van de Meent, Jan-Willem (Ed.)
We study the sample complexity of causal structure learning on a two-variable system with observational and experimental data. Specifically, for two variables X and Y, we consider the classical scenario where either X causes Y , Y causes X, or there is an unmeasured confounder between X and Y. We show that if X and Y are over a finite domain of size k and are significantly correlated, the minimum number of interventional samples needed is sublinear in k. We give a tight characterization of the tradeoff between observational and interventional data when the number of observational samples is sufficiently large. We build upon techniques for closeness testing and for non-parametric density estimation in different regimes of observational data. Our hardness results are based on carefully constructing causal models whose marginal and interventional distributions form hard instances of canonical results on property testing.
more » « less
Full Text Available
The Role of Interactivity in Structured Estimation

Acharya, Jayadev; Canonne, Clement L.; Sun, Ziteng; Tyagi, Himanshu (January 2022, Proceedings of Machine Learning Research)
Loh, Po-Ling; Raginsky, Maxim (Ed.)
Full Text Available
Interactive Inference Under Information Constraints

https://doi.org/10.1109/TIT.2021.3123905

Acharya, Jayadev; Canonne, Clement L.; Liu, Yuhan; Sun, Ziteng; Tyagi, Himanshu (January 2022, IEEE Transactions on Information Theory)

Full Text Available
Distributed Estimation with Multiple Samples per User: Sharp Rates and Phase Transition

Acharya, Jayadev; Canonne, Clement; Liu, Yuhan; Sun, Ziteng; Tyagi, Himanshu (December 2021, Advances in neural information processing systems)

We obtain tight minimax rates for the problem of distributed estimation of discrete distributions under communication constraints, where n users observing m samples each can broadcast only ℓ bits. Our main result is a tight characterization (up to logarithmic factors) of the error rate as a function of m, ℓ, the domain size, and the number of users under most regimes of interest.
more » « less
Full Text Available
Robust Testing and Estimation under Manipulation Attacks

Acharya, Jayadev; Sun, Ziteng; Zhang, Huanyu (July 2021, Proceedings of Machine Learning Research)

We study robust testing and estimation of discrete distributions in the strong contamination model. Our results cover both centralized setting and distributed setting with general local information constraints including communication and LDP constraints. Our technique relates the strength of manipulation attacks to the earth-mover distance using Hamming distance as the metric between messages (samples) from the users. In the centralized setting, we provide optimal error bounds for both learning and testing. Our lower bounds under local information constraints build on the recent lower bound methods in distributed inference. In the communication constrained setting, we develop novel algorithms based on random hashing and an L1-L1 isometry.
more » « less
Full Text Available
Estimating Sparse Discrete Distributions Under Privacy and Communication Constraints

Acharya, Jayadev; Kairouz, Peter; Liu, Yuhan; Sun, Ziteng (March 2021, Proceedings of Machine Learning Research)

We consider the problem of estimating sparse discrete distributions under local differential privacy (LDP) and communication constraints. We characterize the sample complexity for sparse estimation under LDP constraints up to a constant factor, and the sample complexity under communication constraints up to a logarithmic factor. Our upper bounds under LDP are based on the Hadamard Response, a private coin scheme that requires only one bit of communication per user. Under communication constraints we propose public coin schemes based on random hashing functions. Our tight lower bounds are based on recently proposed method of chi squared contractions.
more » « less
Full Text Available

« Prev Next »

Search for: All records