NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Optimal Hypothesis Selection in (Almost) Linear Time

Aliakbarpour, Maryam; Bun, Mark; Smith, Adam (December 2024, NeurIPS 2024)

Full Text Available
Oracle-Efficient Differentially Private Learning with Public Data

Block, Adam; Bun, Mark; Desai, Rathin; Shetty, Abhishek; Wu, Zhiwei Steven (December 2024, NeurIPS 2024)

Full Text Available
Continual Release of Differentially Private Synthetic Data from Longitudinal Data Collections

https://doi.org/10.1145/3651595

Bun, Mark; Gaboardi, Marco; Neunhoeffer, Marcel; Zhang, Wanrong (May 2024, Proceedings of the ACM on Management of Data)

Motivated by privacy concerns in long-term longitudinal studies in medical and social science research, we study the problem of continually releasing differentially private synthetic data from longitudinal data collections. We introduce a model where, in every time step, each individual reports a new data element, and the goal of the synthesizer is to incrementally update a synthetic dataset in a consistent way to capture a rich class of statistical properties. We give continual synthetic data generation algorithms that preserve two basic types of queries: fixed time window queries and cumulative time queries. We show nearly tight upper bounds on the error rates of these algorithms and demonstrate their empirical performance on realistically sized datasets from the U.S. Census Bureau's Survey of Income and Program Participation.
more » « less
Full Text Available
Private PAC Learning May be Harder than Online Learning

Bun, Mark; Cohen, Aloni; Desai, Rathin (March 2024, International Conference on Algorithmic Learning Theory)

Full Text Available
Not All Learnable Distribution Classes are Privately Learnable

Bun, Mark; Kamath, Gautam; Mouzakis, Argyris; Singhal, Vikrant (March 2024, International Conference on Algorithmic Learning Theory)

Full Text Available
Hypothesis Selection with Memory Constraints

Aliakbarpour, Maryam; Bun, Mark; Smith, Adam (February 2024, Neural Information Processing Systems)

Full Text Available
Differentially private confidence intervals for proportions under stratified random sampling

https://doi.org/10.1214/24-EJS2234

Lin, Shurong; Bun, Mark; Gaboardi, Marco; Kolaczyk, Eric D; Smith, Adam (January 2024, Electronic Journal of Statistics)

Full Text Available
Approximate Degree Lower Bounds for Oracle Identification Problems

Bun, Mark; Voronova, Nadezhda (July 2023, 18th Conference on the Theory of Quantum Computation, Communication and Cryptography (TQC 2023))
Fawzi, Omar; Walter, Michael (Ed.)
The approximate degree of a Boolean function is the minimum degree of real polynomial that approximates it pointwise. For any Boolean function, its approximate degree serves as a lower bound on its quantum query complexity, and generically lifts to a quantum communication lower bound for a related function. We introduce a framework for proving approximate degree lower bounds for certain oracle identification problems, where the goal is to recover a hidden binary string x ∈ {0, 1}ⁿ given possibly non-standard oracle access to it. Our lower bounds apply to decision versions of these problems, where the goal is to compute the parity of x. We apply our framework to the ordered search and hidden string problems, proving nearly tight approximate degree lower bounds of Ω(n/log² n) for each. These lower bounds generalize to the weakly unbounded error setting, giving a new quantum query lower bound for the hidden string problem in this regime. Our lower bounds are driven by randomized communication upper bounds for the greater-than and equality functions.
more » « less
Full Text Available
Stability Is Stable: Connections between Replicability, Privacy, and Adaptive Generalization

https://doi.org/10.1145/3564246.3585246

Bun, Mark; Gaboardi, Marco; Hopkins, Max; Impagliazzo, Russell; Lei, Rex; Pitassi, Toniann; Sivakumar, Satchit; Sorrell, Jessica (June 2023, STOC 2023: Proceedings of the 55th Annual ACM Symposium on Theory of Computing)

The notion of replicable algorithms was introduced by Impagliazzo, Lei, Pitassi, and Sorrell (STOC’22) to describe randomized algorithms that are stable under the resampling of their inputs. More precisely, a replicable algorithm gives the same output with high probability when its randomness is fixed and it is run on a new i.i.d. sample drawn from the same distribution. Using replicable algorithms for data analysis can facilitate the verification of published results by ensuring that the results of an analysis will be the same with high probability, even when that analysis is performed on a new data set. In this work, we establish new connections and separations between replicability and standard notions of algorithmic stability. In particular, we give sample-efficient algorithmic reductions between perfect generalization, approximate differential privacy, and replicability for a broad class of statistical problems. Conversely, we show any such equivalence must break down computationally: there exist statistical problems that are easy under differential privacy, but that cannot be solved replicably without breaking public-key cryptography. Furthermore, these results are tight: our reductions are statistically optimal, and we show that any computational separation between DP and replicability must imply the existence of one-way functions. Our statistical reductions give a new algorithmic framework for translating between notions of stability, which we instantiate to answer several open questions in replicability and privacy. This includes giving sample-efficient replicable algorithms for various PAC learning, distribution estimation, and distribution testing problems, algorithmic amplification of δ in approximate DP, conversions from item-level to user-level privacy, and the existence of private agnostic-to-realizable learning reductions under structured distributions.
more » « less
Full Text Available
Stability is Stable: Connections between Replicability, Privacy and Adaptive Generalization

Bun, Mark; Gaboardi, Marco; Hopkins, Max; Impagliazzo, Russell; Lei, Rex; Pitassi, Toniann; Sivakumar, Satchit; Sorrell, Jessica (April 2023, Proceedings of the annual ACM Symposium on Theory of Computing)

Full Text Available

« Prev Next »

Search for: All records