The genome-wide association study (GWAS) approach has discovered hundreds of genetic variants associated with diseases and quantitative traits. However, despite clinical overlap and statistical correlation between many phenotypes, GWAS are generally performed one-phenotype-at-a-time. Here we compare the performance of modelling multiple phenotypes jointly with that of the standard univariate approach. We introduce a new method and software, MultiPhen, that models multiple phenotypes simultaneously in a fast and interpretable way. By performing ordinal regression, MultiPhen tests the linear combination of phenotypes most associated with the genotypes at each SNP, and thus potentially captures effects hidden to single phenotype GWAS. We demonstrate via simulation that this approach provides a dramatic increase in power in many scenarios. There is a boost in power for variants that affect multiple phenotypes and for those that affect only one phenotype. While other multivariate methods have similar power gains, we describe several benefits of MultiPhen over these. In particular, we demonstrate that other multivariate methods that assume the genotypes are normally distributed, such as canonical correlation analysis (CCA) and MANOVA, can have highly inflated type-1 error rates when testing case-control or non-normal continuous phenotypes, while MultiPhen produces no such inflation. To test the performance of MultiPhen on real data we applied it to lipid traits in the Northern Finland Birth Cohort 1966 (NFBC1966). In these data MultiPhen discovers 21% more independent SNPs with known associations than the standard univariate GWAS approach, while applying MultiPhen in addition to the standard approach provides 37% increased discovery. The most associated linear combinations of the lipids estimated by MultiPhen at the leading SNPs accurately reflect the Friedewald Formula, suggesting that MultiPhen could be used to refine the definition of existing phenotypes or uncover novel heritable phenotypes.
SUMMARY Intrahepatic cholangiocarcinoma (ICC) and hepatocellular carcinoma (HCC) are clinically disparate primary liver cancers with etiological and biological heterogeneity. We identified common molecular subtypes linked to similar prognosis among 199 Thai ICC and HCC patients through systems integration of genomics, transcriptomics, and metabolomics. While ICC and HCC share recurrently mutated genes, including TP53, ARID1A, and ARID2, mitotic checkpoint anomalies distinguish the C1 subtype with key drivers PLK1 and ECT2, whereas the C2 subtype is linked to obesity, T-cell infiltration and bile acid metabolism. These molecular subtypes are found in 582 Asian, but less so in 265 Caucasian patients. Thus, Asian ICC and HCC, while clinically treated as separate entities, share common molecular subtypes with similar actionable drivers to improve precision therapy.
Our work provides unique perspectives into the biodiversity of CSC subpopulations, whose molecular heterogeneity further highlights their role in tumor heterogeneity, prognosis, and hepatic CSC therapy. (Hepatology 2018;68:127-140).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.