Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2013
DOI: 10.1145/2487575.2487687
|View full text |Cite
|
Sign up to set email alerts
|

Privacy-preserving data exploration in genome-wide association studies

Abstract: Genome-wide association studies (GWAS) have become a popular method for analyzing sets of DNA sequences in order to discover the genetic basis of disease. Unfortunately, statistics published as the result of GWAS can be used to identify individuals participating in the study. To prevent privacy breaches, even previously published results have been removed from public databases, impeding researchers’ access to the data and hindering collaborative research. Existing techniques for privacy-preserving GWAS focus o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
195
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 160 publications
(196 citation statements)
references
References 37 publications
1
195
0
Order By: Relevance
“…To assess the potential gain in utility of our relaxation, we focus on a particular application of DP, by re-evaluating the privacy protecting mechanisms in genome-wide association studies [10,19,22] for the release of SNPs with high χ 2 -statistics. Our results show that, for a bounded adversarial model, we require up to 2500 fewer patients in the study, in order to reach an acceptable tradeoff between privacy and medical utility.…”
Section: Results Assessment and Implicationsmentioning
confidence: 99%
See 1 more Smart Citation
“…To assess the potential gain in utility of our relaxation, we focus on a particular application of DP, by re-evaluating the privacy protecting mechanisms in genome-wide association studies [10,19,22] for the release of SNPs with high χ 2 -statistics. Our results show that, for a bounded adversarial model, we require up to 2500 fewer patients in the study, in order to reach an acceptable tradeoff between privacy and medical utility.…”
Section: Results Assessment and Implicationsmentioning
confidence: 99%
“…For instance, Fredrikson et al [6] investigate personalized warfarin dosing and demonstrate that for privacy budgets effective against a certain type of inference attacks, satisfying DP exposes patients to highly increased mortality risks. Similarly, studies on privacy in genome-wide association studies (GWAS) [10,19,22] consider differential privacy as a protective measure against an inference attack discovered by Homer et al [9,20]. These works show that for reasonably small values of , the medical utility is essentially null under DP, unless there is an access to impractically large patient datasets.…”
Section: Introductionmentioning
confidence: 99%
“…p m is a non-uniform sequence distribution and we assume γ ≤ 3 − √ 5 ≈ 0.76, which is a requirement for Corollary 1 (in [13]). This assumption is reasonable considering the length of the sequence n (≥ 20000) 5 . To estimate γ, we can consider the sequence with all major alleles and pessimistically assume each major allele frequency is 0.995, large enough to give an upper bound for real datasets.…”
Section: Proofmentioning
confidence: 99%
“…As a response to the above privacy breach in published genomic statistics, Fienberg et al [4] propose to apply Laplacian noise to the released data to achieve differential privacy. Another approach to achieving differential privacy in genome-wide association study is proposed by Johnson and Shmatikov [5]. Yu et al [6] present scalable privacy-preserving methods in genome-wide association studies based on Laplace mechanism and exponential mechanism.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation