Winning the NIST Contest: A scalable and general approach to differentially private synthetic data

McKenna, Ryan; Miklau, Gerome; Sheldon, Daniel

doi:10.29012/jpc.778

Citation Details

Winning the NIST Contest: A scalable and general approach to differentially private synthetic data

We propose a general approach for differentially private synthetic data generation, that consists of three steps: (1) select a collection of low-dimensional marginals, (2) measure those marginals with a noise addition mechanism, and (3) generate synthetic data that preserves the measured marginals well. Central to this approach is Private-PGM, a post-processing method that is used to estimate a high-dimensional data distribution from noisy measurements of its marginals. We present two mechanisms, NIST-MST and MST, that are instances of this general approach. NIST-MST was the winning mechanism in the 2018 NIST differential privacy synthetic data competition, and MST is a new mechanism that can work in more general settings, while still performing comparably to NIST-MST. We believe our general approach should be of broad interest, and can be adopted in future mechanisms for synthetic data generation. more »

Award ID(s):: 1749854

PAR ID:: 10359643

Author(s) / Creator(s):: McKenna, Ryan; Miklau, Gerome; Sheldon, Daniel

Date Published:: 2021-12-24

Journal Name:: Journal of Privacy and Confidentiality

Volume:: 11

Issue:: 3

ISSN:: 2575-8527

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.29012/jpc.778

More Like this