Genome-wide association studies (GWAS) have identified thousands of variants robustly associated with complex traits. However, the biological mechanisms underlying these associations are, in general, not well understood. We propose a gene-based association method called PrediXcan that directly tests the molecular mechanisms through which genetic variation affects phenotype. The approach estimates the component of gene expression determined by an individual’s genetic profile and correlates the “imputed” gene expression with the phenotype under investigation to identify genes involved in the etiology of the phenotype. The genetically regulated gene expression is estimated using whole-genome tissue-dependent prediction models trained with reference transcriptome datasets. PrediXcan enjoys the benefits of gene-based approaches such as reduced multiple testing burden and a principled approach to the design of follow-up experiments. Our results demonstrate that PrediXcan can detect known and novel genes associated with disease traits and provide insights into the mechanism of these associations.
Scalable, integrative methods to understand mechanisms that link genetic variants with phenotypes are needed. Here we derive a mathematical expression to compute PrediXcan (a gene mapping approach) results using summary data (S-PrediXcan) and show its accuracy and general robustness to misspecified reference sets. We apply this framework to 44 GTEx tissues and 100+ phenotypes from GWAS and meta-analysis studies, creating a growing public catalog of associations that seeks to capture the effects of gene expression variation on human phenotypes. Replication in an independent cohort is shown. Most of the associations are tissue specific, suggesting context specificity of the trait etiology. Colocalized significant associations in unexpected tissues underscore the need for an agnostic scanning of multiple contexts to improve our ability to detect causal regulatory mechanisms. Monogenic disease genes are enriched among significant associations for related traits, suggesting that smaller alterations of these genes may cause a spectrum of milder phenotypes.
14To understand the biological mechanisms underlying the thousands of genetic variants robustly associated with 15 complex traits, scalable methods that integrate GWAS and functional data generated by large-scale efforts are 16 needed. We derived a mathematical expression to compute PrediXcan results using summary data (S-17 PrediXcan) and showed its accuracy and robustness to misspecified reference populations. We compared S-18PrediXcan with existing methods and combined them into a best practice framework (MetaXcan) that 19integrates GWAS with QTL studies and reduces LD-confounded associations. We applied this framework to 44 20 GTEx tissues and 101 phenotypes from GWAS and meta-analysis studies, creating a growing catalog of 21 associations that captures the effects of gene expression variation on human phenotypes. Most of the 22 associations were tissue specific, indicating context specificity of the trait etiology. Colocalized significant 23 associations in unexpected tissues underscore the advantages of an agnostic scanning of multiple contexts to 24 increase the probability of detecting causal regulatory mechanisms. 25Prediction models, efficient software implementation, and association results are shared as a resource for 26 the research community.
We developed and clinically validated a hybrid capture next generation sequencing assay to detect somatic alterations and microsatellite instability in solid tumors and hematologic malignancies. This targeted oncology assay utilizes tumor-normal matched samples for highly accurate somatic alteration calling and whole transcriptome RNA sequencing for unbiased identification of gene fusion events. The assay was validated with a combination of clinical specimens and cell lines, and recorded a sensitivity of 99.1% for single nucleotide variants, 98.1% for indels, 99.9% for gene rearrangements, 98.4% for copy number variations, and 99.9% for microsatellite instability detection. This assay presents a wide array of data for clinical management and clinical trial enrollment while conserving limited tissue.
Understanding the genetic architecture of gene expression traits is key to elucidating the underlying mechanisms of complex traits. Here, for the first time, we perform a systematic survey of the heritability and the distribution of effect sizes across all representative tissues in the human body. We find that local h2 can be relatively well characterized with 59% of expressed genes showing significant h2 (FDR < 0.1) in the DGN whole blood cohort. However, current sample sizes (n ≤ 922) do not allow us to compute distal h2. Bayesian Sparse Linear Mixed Model (BSLMM) analysis provides strong evidence that the genetic contribution to local expression traits is dominated by a handful of genetic variants rather than by the collective contribution of a large number of variants each of modest size. In other words, the local architecture of gene expression traits is sparse rather than polygenic across all 40 tissues (from DGN and GTEx) examined. This result is confirmed by the sparsity of optimal performing gene expression predictors via elastic net modeling. To further explore the tissue context specificity, we decompose the expression traits into cross-tissue and tissue-specific components using a novel Orthogonal Tissue Decomposition (OTD) approach. Through a series of simulations we show that the cross-tissue and tissue-specific components are identifiable via OTD. Heritability and sparsity estimates of these derived expression phenotypes show similar characteristics to the original traits. Consistent properties relative to prior GTEx multi-tissue analysis results suggest that these traits reflect the expected biology. Finally, we apply this knowledge to develop prediction models of gene expression traits for all tissues. The prediction models, heritability, and prediction performance R2 for original and decomposed expression phenotypes are made publicly available (https://github.com/hakyimlab/PrediXcan).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.