Background:
Identifying genetic variation associated with plasma protein levels, and the mechanisms by which they act, could provide insight into alterable processes involved in regulation of protein levels. While protein levels can be affected by genetic variants, their estimation can also be biased by missense variants in coding exons causing technical artifacts. Integrating genome sequence genotype data with mass spectrometry-based protein level estimation could reduce bias, thereby improving detection of variation that affects RNA or protein metabolism.
Methods:
Here, we integrate the blood plasma protein levels of 664 proteins from 165 participants of the Tromsø Study, measured via TMT-mass spectrometry, with whole exome sequencing data to identify common and rare genetic variation associated with peptide and protein levels (pQTLs). We additionally use literature and database searches to prioritize putative functional variants for each pQTL.
Results:
We identify 109 independent associations (36 protein and 73 peptide), and use genotype data to exclude 49 (4 protein and 45 peptide) as technical artifacts. We describe two particular cases of rare variation: one associated with the complement pathway, and one with platelet degranulation. We identify putative functional variants and show that pQTLs act through diverse molecular mechanisms that affect both RNA and protein metabolism.
Conclusions:
We show that, while the majority of pQTLs exert their effects by modulating RNA metabolism, many affect protein levels directly. Our work demonstrates the extent by which pQTL studies are affected by technical artifacts, and highlights how prioritizing the functional variant in pQTL studies can lead to insights into the molecular steps by which a protein may be regulated.