Differentially Private Simple Linear Regression

Alabi, Daniel; McMillan, Audra; Sarathy, Jayshree; Smith, Adam; Vadhan, Salil

doi:10.2478/popets-2022-0041

Cited by 15 publications

(33 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Crucially, in contrast to the majority of prior works on regression, samples 𝑋 𝑖 are indeed unbounded, as they are sampled from 𝒩 (𝜇, Σ). Finally, the boundedness of the outputs 𝑦 𝑖 , 𝑖 ∈ [𝑛], is a requirement we share with other works (e.g., Alabi et al (2020); Wang (2018); Kifer et al (2012); Zhang et al (2012)), and clearly applies to, e.g., binary classification; we also study unbounded labels in the Linear Regression setting.…”

Section: Problem Formulationmentioning

confidence: 99%

“…Linear regression is of course a true workhorse of statistics, and there has been a significant body of work on the design of computationally and statistically efficient differentially private regression algorithms (see e.g., the recent surveys of Cai et al (2020); Wang (2018) and the references therein). Approaches include objective perturbation (Iyengar et al, 2019;Kifer et al, 2012;Zhang et al, 2012;Chaudhuri et al, 2011), output perturbation (Asi and Duchi, 2020;Iyengar et al, 2019;Zhang et al, 2017;Jain and Thakurta, 2014), gradient perturbation (Abadi et al, 2016;Bassily et al, 2014), subsample-and-aggregate (Barrientos et al, 2019;Dwork and Smith, 2010), and sufficient statistics perturbation (Alabi et al, 2020;Wang, 2018;McSherry and Mironov, 2009). Additionally, several works study generalizations of such mechanisms to Generalized Linear Models (GLMs) (Kulkarni et al, 2021;Iyengar et al, 2019;Jain and Thakurta, 2014;Kifer et al, 2012).…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Differentially Private Regression with Unbounded Covariates

Milionis¹,

Alkis²,

Fotakis³

et al. 2022

Preprint

View full text Add to dashboard Cite

We provide computationally efficient, differentially private algorithms for the classical regression settings of Least Squares Fitting, Binary Regression and Linear Regression with unbounded covariates. Prior to our work, privacy constraints in such regression settings were studied under strong a priori bounds on covariates. We consider the case of Gaussian marginals and extend recent differentially private techniques on mean and covariance estimation (Kamath et al., 2019;Karwa and Vadhan, 2018) to the sub-gaussian regime. We provide a novel technical analysis yielding differentially private algorithms for the above classical regression settings. Through the case of Binary Regression, we capture the fundamental and widely-studied models of logistic regression and linearly-separable SVMs, learning an unbiased estimate of the true regression vector, up to a scaling factor.

show abstract

Section: Problem Formulationmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Differentially Private Regression with Unbounded Covariates

Milionis¹,

Alkis²,

Fotakis³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…However, finding a differentially private estimator for this task that is accurate across a range of datasets and parameter regimes is surprisingly nuanced. There has been a significant amount of prior work on differentially private point estimators for the median [Nissim et al, 2007, Bun and Steinke, 2019, Asi and Duchi, 2020, Alabi et al, 2020, Tzamos et al, 2020 and other quantiles [Gillenwater et al, 2021]. To the best of our knowledge, none of these works addressed DP confidence intervals for the median.…”

Section: Related Workmentioning

confidence: 99%

“…Our first private mechanism is an instantiation of the exponential mechanism [McSherry and Talwar, 2007], a differentially private algorithm designed for general optimization problems. The exponential mechanism has been used in prior work to give DP point estimates for the median [Dwork and Lei, 2009, Thakurta and Smith, 2013, Johnson and Shmatikov, 2013, Alabi et al, 2020, Asi and Duchi, 2020. Our extension to providing confidence intervals for the median, while using similar ideas to prior work, requires a careful coverage analysis that is new to this work.…”

Section: Confidence Intervals Based On Exponential Mechanism Expmechmentioning

confidence: 99%

See 1 more Smart Citation

Non-parametric Differentially Private Confidence Intervals for the Median

Drechsler,

Globus-Harris,

McMillan

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Differential privacy is a restriction on data processing algorithms that provides strong confidentiality guarantees for individual records in the data. However, research on proper statistical inference, that is, research on properly quantifying the uncertainty of the (noisy) sample estimate regarding the true value in the population, is currently still limited. This paper proposes and evaluates several strategies to compute valid differentially private confidence intervals for the median. Instead of computing a differentially private point estimate and deriving its uncertainty, we directly estimate the interval bounds and discuss why this approach is superior if ensuring privacy is important. We also illustrate that addressing both sources of uncertainty-the error from sampling and the error from protecting the output-simultaneously should be preferred over simpler approaches that incorporate the uncertainty in a sequential fashion. We evaluate the performance of the different algorithms under various parameter settings in extensive simulation studies and demonstrate how the findings could be applied in practical settings using data from the 1940 Decennial Census.

show abstract

Simulation-based, Finite-sample Inference for Privatized Data

Awan,

Wang

2024

Journal of the American Statistical Association

View full text Add to dashboard Cite

Differentially Private Simple Linear Regression

Cited by 15 publications

References 25 publications

Differentially Private Regression with Unbounded Covariates

Differentially Private Regression with Unbounded Covariates

Non-parametric Differentially Private Confidence Intervals for the Median

Simulation-based, Finite-sample Inference for Privatized Data

Contact Info

Product

Resources

About