2019
DOI: 10.48550/arxiv.1902.04495
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

The Cost of Privacy: Optimal Rates of Convergence for Parameter Estimation with Differential Privacy

Abstract: Privacy-preserving data analysis is a rising challenge in contemporary statistics, as the privacy guarantees of statistical methods are often achieved at the expense of accuracy. In this paper, we investigate the tradeoff between statistical accuracy and privacy in mean estimation and linear regression, under both the classical lowdimensional and modern high-dimensional settings. A primary focus is to establish minimax optimality for statistical estimation with the (ε, δ)-differential privacy constraint. To th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
47
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 28 publications
(47 citation statements)
references
References 37 publications
0
47
0
Order By: Relevance
“…Even for DP linear regression without robustness, HPTR is the first algorithm for sub-Gaussian distributions with an unknown covariance Σ that up to log factors matches the lower bound of n = Ω(d/α 2 + d/(αε)) assuming ε < 1 and δ < n −1−ω for some ω > 0 from [CWZ19, Theorem 4.1]. For completeness, we provide the lower bound in Appendix C. An existing algorithm for DP linear regression from [CWZ19] is suboptimal as it require Σ to be close to the identity matrix, which is equivalent to assuming that we know Σ.…”
Section: Hypercontractive Distributions With Independent Noisementioning
confidence: 94%
See 1 more Smart Citation
“…Even for DP linear regression without robustness, HPTR is the first algorithm for sub-Gaussian distributions with an unknown covariance Σ that up to log factors matches the lower bound of n = Ω(d/α 2 + d/(αε)) assuming ε < 1 and δ < n −1−ω for some ω > 0 from [CWZ19, Theorem 4.1]. For completeness, we provide the lower bound in Appendix C. An existing algorithm for DP linear regression from [CWZ19] is suboptimal as it require Σ to be close to the identity matrix, which is equivalent to assuming that we know Σ.…”
Section: Hypercontractive Distributions With Independent Noisementioning
confidence: 94%
“…There exists a (ε, δ)-differentially private algorithm β(S) that given HPTR is the first algorithm for sub-Gaussian distributions with an unknown covariance Σ that up to logarithmic factors matches the lower bound of n = Ω(d/ξ 2 + d/(ξε)) assuming ε < 1 and δ < n −1−ω for some ω > 0 from [CWZ19, Theorem 4.1]. An existing algorithm for DP linear regression from [CWZ19] is suboptimal as it require Σ to be close to the identity matrix, which is equivalent to assuming that we know Σ. [DL09] proposes to use PTR and B-robust regression algorithm from [HRRS86] for differentially private linear regression under i.i.d.…”
Section: Dp Linear Regressionmentioning
confidence: 99%
“…The trade-off between statistical accuracy and privacy is one of the fundamental topics in differential privacy. In the low-dimensional setting, there are various works focusing on this trade-off, including mean estimation [28,52,8,34,10,36], confidence intervals of Gaussian mean [37] and binomial mean [4], linear regression [55,10], generalized linear models [51,11,50], principal component analysis [30,12], convex empirical risk minimization [7], and robust M-estimators [2,3].…”
Section: Introductionmentioning
confidence: 99%
“…Most of the existing works focus on relatively standard statistical problems such as the sparse mean estimation and regression. For example, [10] studies near-optimal algorithms for the sparse mean estimation.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation