2023
DOI: 10.1038/s41598-023-34866-6
|View full text |Cite
|
Sign up to set email alerts
|

Large scale proteomic studies create novel privacy considerations

Abstract: Privacy protection is a core principle of genomic but not proteomic research. We identified independent single nucleotide polymorphism (SNP) quantitative trait loci (pQTL) from COPDGene and Jackson Heart Study (JHS), calculated continuous protein level genotype probabilities, and then applied a naïve Bayesian approach to link SomaScan 1.3K proteomes to genomes for 2812 independent subjects from COPDGene, JHS, SubPopulations and InteRmediate Outcome Measures In COPD Study (SPIROMICS) and Multi-Ethnic Study of A… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 29 publications
(25 reference statements)
0
2
0
Order By: Relevance
“…Our framework unifies QTL privacy analyses, because it allows joint prediction of genotypes from any source. It would be interesting to explore incorporating other sources of phenotypic data that could leak genotype information into the DSM, including protein abundance (pQTLs) (Hill et al 2023), methylation (mQTLs) (Gaunt et al 2016;Backes et al 2017), and allele-specific expression (aseQTLs) (Gürsoy et al 2021). Another promising direction would be extending the DSM formulation to the coexpression of genes conditional upon eQTLs, which has been used by other models (Gamazon et al 2015;Gusev et al 2016).…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Our framework unifies QTL privacy analyses, because it allows joint prediction of genotypes from any source. It would be interesting to explore incorporating other sources of phenotypic data that could leak genotype information into the DSM, including protein abundance (pQTLs) (Hill et al 2023), methylation (mQTLs) (Gaunt et al 2016;Backes et al 2017), and allele-specific expression (aseQTLs) (Gürsoy et al 2021). Another promising direction would be extending the DSM formulation to the coexpression of genes conditional upon eQTLs, which has been used by other models (Gamazon et al 2015;Gusev et al 2016).…”
Section: Discussionmentioning
confidence: 99%
“…The growing availability of large-scale genomic data repositories has led to increasing concerns for the privacy of the individuals from whom the data were collected ( Erlich and Narayanan 2014 ; Naveed et al 2015 ; Bonomi et al 2020 ). Although many nations and organizations have introduced policies and regulations (e.g., HIPAA and GDPR) to safeguard the collection, use, and sharing of personally-identifying information, existing policies often fall short of providing clear guidance regarding many widely used types of biomedical data, including genetic sequences and functional genomic data, for which the underlying privacy risks are often unclear and only beginning to be understood ( Clayton et al 2019 ; Gürsoy et al 2020 , 2021 , 2022b ; Wan et al 2022 ; Hill et al 2023 ). This lack of guidance, particularly for functional genomic data, presents a key challenge for ensuring the protection of study participants, leaving the possibility for future privacy breaches that may diminish public trust in the scientific community.…”
mentioning
confidence: 99%