Improved Differentially Private Analysis of Variance

Swanberg, Marika; Globus-Harris, Ira; Griffith, Iris; Ritz, Anna; Groce, Adam; Bray, Andrew

doi:10.2478/popets-2019-0049

Citation Details

Improved Differentially Private Analysis of Variance

Abstract Hypothesis testing is one of the most common types of data analysis and forms the backbone of scientific research in many disciplines. Analysis of variance (ANOVA) in particular is used to detect dependence between a categorical and a numerical variable. Here we show how one can carry out this hypothesis test under the restrictions of differential privacy. We show that the F -statistic, the optimal test statistic in the public setting, is no longer optimal in the private setting, and we develop a new test statistic F 1 with much higher statistical power. We show how to rigorously compute a reference distribution for the F 1 statistic and give an algorithm that outputs accurate p -values. We implement our test and experimentally optimize several parameters. We then compare our test to the only previous work on private ANOVA testing, using the same effect size as that work. We see an order of magnitude improvement, with our test requiring only 7% as much data to detect the effect. more »

Award ID(s):: 1817245

PAR ID:: 10107364

Author(s) / Creator(s):: Swanberg, Marika; Globus-Harris, Ira; Griffith, Iris; Ritz, Anna; Groce, Adam; Bray, Andrew

Date Published:: 2019-07-01

Journal Name:: Proceedings on Privacy Enhancing Technologies

Volume:: 2019

Issue:: 3

ISSN:: 2299-0984

Page Range / eLocation ID:: 310 to 330

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.2478/popets-2019-0049

More Like this