Differentially Private Inference for Binomial Data

We derive uniformly most powerful (UMP) tests for simple and one-sided hypotheses for a population proportion within the framework of Differential Privacy (DP), optimizing finite sample performance. We show that in general, DP hypothesis tests can be written in terms of linear constraints, and for e...

Full description

Bibliographic Details
Main Authors:	Jordan Alexander Awan, Aleksandra Slavkovic
Format:	Article
Language:	English
Published:	Labor Dynamics Institute 2020-01-01
Series:	The Journal of Privacy and Confidentiality
Subjects:	Bernoulli Hypothesis Test Confidence interval Frequentist Statistical disclosure control Neyman-Pearson
Online Access:	https://journalprivacyconfidentiality.org/index.php/jpc/article/view/725

_version_	1818527922989826048
author	Jordan Alexander Awan Aleksandra Slavkovic
author_facet	Jordan Alexander Awan Aleksandra Slavkovic
author_sort	Jordan Alexander Awan
collection	DOAJ
description	We derive uniformly most powerful (UMP) tests for simple and one-sided hypotheses for a population proportion within the framework of Differential Privacy (DP), optimizing finite sample performance. We show that in general, DP hypothesis tests can be written in terms of linear constraints, and for exchangeable data can always be expressed as a function of the empirical distribution. Using this structure, we prove a `Neyman-Pearson lemma' for binomial data under DP, where the DP-UMP only depends on the sample sum. Our tests can also be stated as a post-processing of a random variable, whose distribution we coin ``Truncated-Uniform-Laplace'' (Tulap), a generalization of the Staircase and discrete Laplace distributions. Furthermore, we obtain exact p-values, which are easily computed in terms of the Tulap random variable. Using the above techniques, we show that our tests can be applied to give uniformly most accurate one-sided confidence intervals and optimal confidence distributions. We also derive uniformly most powerful unbiased (UMPU) two-sided tests, which lead to uniformly most accurate unbiased (UMAU) two-sided confidence intervals. We show that our results can be applied to distribution-free hypothesis tests for continuous data. Our simulation results demonstrate that all our tests have exact type I error, and are more powerful than current techniques.
first_indexed	2024-12-11T06:42:46Z
format	Article
id	doaj.art-c91415d4f64740f59d8e79e855a18cce
institution	Directory Open Access Journal
issn	2575-8527
language	English
last_indexed	2024-12-11T06:42:46Z
publishDate	2020-01-01
publisher	Labor Dynamics Institute
record_format	Article
series	The Journal of Privacy and Confidentiality
spelling	doaj.art-c91415d4f64740f59d8e79e855a18cce2022-12-22T01:17:10ZengLabor Dynamics InstituteThe Journal of Privacy and Confidentiality2575-85272020-01-0110110.29012/jpc.725Differentially Private Inference for Binomial DataJordan Alexander Awan0Aleksandra Slavkovic1Pennsylvania State UniversityPennsylvania State UniversityWe derive uniformly most powerful (UMP) tests for simple and one-sided hypotheses for a population proportion within the framework of Differential Privacy (DP), optimizing finite sample performance. We show that in general, DP hypothesis tests can be written in terms of linear constraints, and for exchangeable data can always be expressed as a function of the empirical distribution. Using this structure, we prove a `Neyman-Pearson lemma' for binomial data under DP, where the DP-UMP only depends on the sample sum. Our tests can also be stated as a post-processing of a random variable, whose distribution we coin ``Truncated-Uniform-Laplace'' (Tulap), a generalization of the Staircase and discrete Laplace distributions. Furthermore, we obtain exact p-values, which are easily computed in terms of the Tulap random variable. Using the above techniques, we show that our tests can be applied to give uniformly most accurate one-sided confidence intervals and optimal confidence distributions. We also derive uniformly most powerful unbiased (UMPU) two-sided tests, which lead to uniformly most accurate unbiased (UMAU) two-sided confidence intervals. We show that our results can be applied to distribution-free hypothesis tests for continuous data. Our simulation results demonstrate that all our tests have exact type I error, and are more powerful than current techniques.https://journalprivacyconfidentiality.org/index.php/jpc/article/view/725BernoulliHypothesis TestConfidence intervalFrequentistStatistical disclosure controlNeyman-Pearson
spellingShingle	Jordan Alexander Awan Aleksandra Slavkovic Differentially Private Inference for Binomial Data The Journal of Privacy and Confidentiality Bernoulli Hypothesis Test Confidence interval Frequentist Statistical disclosure control Neyman-Pearson
title	Differentially Private Inference for Binomial Data
title_full	Differentially Private Inference for Binomial Data
title_fullStr	Differentially Private Inference for Binomial Data
title_full_unstemmed	Differentially Private Inference for Binomial Data
title_short	Differentially Private Inference for Binomial Data
title_sort	differentially private inference for binomial data
topic	Bernoulli Hypothesis Test Confidence interval Frequentist Statistical disclosure control Neyman-Pearson
url	https://journalprivacyconfidentiality.org/index.php/jpc/article/view/725
work_keys_str_mv	AT jordanalexanderawan differentiallyprivateinferenceforbinomialdata AT aleksandraslavkovic differentiallyprivateinferenceforbinomialdata

Differentially Private Inference for Binomial Data

Similar Items