NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Privacy in Metalearning and Multitask Learning: Modeling and Separations

Aliakbarpour, Maryam; Bairaktari, Konstantina; Smith, Adam; Swanberg, Marika; Ullman, Jonathan (May 2025, Proceedings of Machine Learning Research)

Free, publicly-accessible full text available May 1, 2026
A Bias-Accuracy-Privacy Trilemma for Statistical Estimation

https://doi.org/10.1080/01621459.2024.2443275

Kamath, Gautam; Mouzakis, Argyris; Regehr, Matthew; Singhal, Vikrant; Steinke, Thomas; Ullman, Jonathan (February 2025, Journal of the American Statistical Association)

Free, publicly-accessible full text available February 10, 2026
Private Geometric Median

Haghifam, Mahdi; Steinke, Thomas; Ullman, Jonathan (December 2024, Neural and Information Processing Systems)

Free, publicly-accessible full text available December 10, 2025
Private Mean Estimation with Person-Level Differential Privacy

https://doi.org/10.1137/1.9781611978322.92

Agarwal, Sushant; Kamath, Gautam; Majid, Mahbod; Mouzakis, Argyris; Silver, Rose; Ullman, Jonathan (January 2025, Society for Industrial and Applied Mathematics)

Free, publicly-accessible full text available January 1, 2026
How to Make the Gradients Small Privately: Improved Rates for Differentially Private Non-Convex Optimization

Lowy, Andrew; Ullman, Jonathan; Wright, Stephen (July 2024, ICML 2024)

Full Text Available
Smooth Lower Bounds for Differentially Private Algorithms via Padding-and-Permuting Fingerprinting Codes

Peter, Naty; Tsfadia, Eliad; Ullman, Jonathan (June 2024, Proceedings of Machine Learning Research)

Full Text Available
TMI! Finetuned Models Leak Private Information from their Pretraining Data

https://doi.org/10.56553/popets-2024-0075

Abascal, John; Wu, Stanley; Oprea, Alina; Ullman, Jonathan (July 2024, Proceedings on Privacy Enhancing Technologies)

Transfer learning has become an increasingly popular technique in machine learning as a way to leverage a pretrained model trained for one task to assist with building a finetuned model for a related task. This paradigm has been especially popular for privacy in machine learning, where the pretrained model is considered public, and only the data for finetuning is considered sensitive. However, there are reasons to believe that the data used for pretraining is still sensitive, making it essential to understand how much information the finetuned model leaks about the pretraining data. In this work we propose a new membership-inference threat model where the adversary only has access to the finetuned model and would like to infer the membership of the pretraining data. To realize this threat model, we implement a novel metaclassifier-based attack, TMI, that leverages the influence of memorized pretraining samples on predictions in the downstream task. We evaluate TMI on both vision and natural language tasks across multiple transfer learning settings, including finetuning with differential privacy. Through our evaluation, we find that TMI can successfully infer membership of pretraining examples using query access to the finetuned model.
more » « less
Full Text Available
Program Analysis for Adaptive Data Analysis

https://doi.org/10.1145/3656414

Liu, Jiawen; Qu, Weihao; Gaboardi, Marco; Garg, Deepak; Ullman, Jonathan (June 2024, Proceedings of the ACM on Programming Languages)

Data analyses are usually designed to identify some property of the population from which the data are drawn, generalizing beyond the specific data sample. For this reason, data analyses are often designed in a way that guarantees that they produce a low generalization error. That is, they are designed so that the result of a data analysis run on a sample data does not differ too much from the result one would achieve by running the analysis over the entire population. An adaptive data analysis can be seen as a process composed by multiple queries interrogating some data, where the choice of which query to run next may rely on the results of previous queries. The generalization error of each individual query/analysis can be controlled by using an array of well-established statistical techniques. However, when queries are arbitrarily composed, the different errors can propagate through the chain of different queries and bring to a high generalization error. To address this issue, data analysts are designing several techniques that not only guarantee bounds on the generalization errors of single queries, but that also guarantee bounds on the generalization error of the composed analyses. The choice of which of these techniques to use, often depends on the chain of queries that an adaptive data analysis can generate. In this work, we consider adaptive data analyses implemented as while-like programs and we design a program analysis which can help with identifying which technique to use to control their generalization errors. More specifically, we formalize the intuitive notion ofadaptivityas a quantitative property of programs. We do this because the adaptivity level of a data analysis is a key measure to choose the right technique. Based on this definition, we design a program analysis for soundly approximating this quantity. The program analysis generates a representation of the data analysis as a weighted dependency graph, where the weight is an upper bound on the number of times each variable can be reached, and uses a path search strategy to guarantee an upper bound on the adaptivity. We implement our program analysis and show that it can help to analyze the adaptivity of several concrete data analyses with different adaptivity structures.
more » « less
Full Text Available
Metalearning with Very Few Samples Per Task

Aliakbarpour, Maryam; Bairaktari, Konstantina; Brown, Gavin; Smith, Adam; Srebro, Nathan; Ullman, Jonathan (June 2024, Proceedings of Machine Learning Research)

Full Text Available
Metalearning with Very Few Samples Per Task

Aliakbarpour, Maryam; Bairaktari, Konstantina; Brown, Gavin; Smith, Adam; Srebro, Nathan; Ullman, Jonathan (June 2024, Conference on Learning Theory)

« Prev Next »

Search for: All records