The emergence of genetic data coupled to longitudinal electronic medical records (EMRs) offers the possibility of phenome-wide association studies (PheWAS). In PheWAS, the whole phenome can be divided into numerous phenotypic categories according to the genetic architecture across phenotypes. Currently, statistical analyses for PheWAS are mainly univariate analyses, which test the association between one genetic variant and one phenotype at a time. In this article, we derived a novel and powerful multivariate method for PheWAS. The proposed method involves three steps. In the first step, we apply the bottom-up hierarchical clustering method to partition a large number of phenotypes into disjoint clusters within each phenotypic category. In the second step, the clustering linear combination method is used to combine test statistics within each category based on the phenotypic clusters and obtain p-values from each phenotypic category. In the third step, we propose a new false discovery rate (FDR) control approach. We perform extensive simulation studies to compare the performance of our method with that of other existing methods. The results show that our proposed method controls FDR very well and outperforms other methods we compared with. We also apply the proposed approach to a set of EMR-based phenotypes across more than 300,000 samples from the UK Biobank. We find that the proposed approach not only can well-control FDR at a nominal level but also successfully identify 1,244 significant SNPs that are reported to be associated with some phenotypes in the GWAS catalog. Our open-access tools and instructions on how to implement HCLC-FC are available at https://github.com/XiaoyuLiang/HCLCFC.
Joint analysis of multiple correlated phenotypes for genome-wide association studies (GWAS) can identify and interpret pleiotropic loci which are essential to understand pleiotropy in diseases and complex traits. Meanwhile, constructing a network based on associations between phenotypes and genotypes provides a new insight to analyze multiple phenotypes, which can explore whether phenotypes and genotypes might be related to each other at a higher level of cellular and organismal organization. In this paper, we first develop a bipartite signed network by linking phenotypes and genotypes into a Genotype and Phenotype Network (GPN). The GPN can be constructed by a mixture of quantitative and qualitative phenotypes and is applicable to binary phenotypes with extremely unbalanced case-control ratios in large-scale biobank datasets. We then apply a powerful community detection method to partition phenotypes into disjoint network modules based on GPN. Finally, we jointly test the association between multiple phenotypes in a network module and a single nucleotide polymorphism (SNP). Simulations and analyses of 72 complex traits in the UK Biobank show that multiple phenotype association tests based on network modules detected by GPN are much more powerful than those without considering network modules. The newly proposed GPN provides a new insight to investigate the genetic architecture among different types of phenotypes. Multiple phenotypes association studies based on GPN are improved by incorporating the genetic information into the phenotype clustering. Notably, it might broaden the understanding of genetic architecture that exists between diagnoses, genes, and pleiotropy.
The emergence of genetic data coupled to longitudinal electronic medical records (EMRs) offers the possibility of phenome-wide association studies (PheWAS). In PheWAS, the whole phenome can be divided into numerous phenotypic categories according to the genetic architecture across phenotypes. Currently, statistical analyses for PheWAS are mainly univariate analyses, which test the association between one genetic variant and one phenotype at a time. In this article, we derived a novel and powerful multivariate method for PheWAS. The proposed method involves three steps. In the first step, we apply the bottom-up hierarchical clustering method to partition a large number of phenotypes into disjoint clusters within each phenotypic category. In the second step, the clustering linear combination method is used to combine test statistics within each category based on the phenotypic clusters and obtain p-values from each phenotypic category. In the third step, we propose a new false discovery rate (FDR) control approach. We perform extensive simulation studies to compare the performance of our method with that of other existing methods. The results show that our proposed method controls FDR very well and outperforms other methods we compared with. We also apply the proposed approach to a set of EMR-based phenotypes across more than 300,000 samples from UK Biobank. We find that the proposed approach not only can well-control FDR at a nominal level but also successfully identify 1,244 significant SNPs that are reported to be associated with some phenotypes in the GWAS catalog. Our open-access tools and instructions on how to implement HCLC-FC are available at https://github.com/XiaoyuLiang/HCLCFC .
Angiotensin converting enzyme-2 (ACE2) and associated proteins play a pivotal role in various physiological and pathological events, such as immune activation, inflammation, gut barrier maintenance, intestinal stem cell proliferation, and apoptosis. Although many of these clinical events are quite significant in SIV/HIV infection, expression profiling of these proteins has not been well reported. Considering the different pathological consequences in the gut after HIV infection, we hypothesized that the expression of ACE2 and associated proteins of the Renin-angiotensin system (RAS) could be compromised after SIV/HIV infection. We quantified the gene expression of ACE2 as well as AGTR1/2, ADAM17, and TMPRSS2, and compared between SIV infected and uninfected rhesus macaques (Macaca mulatta; hereafter abbreviated RMs). The gene expression analysis revealed significant downregulation of ACE2 and upregulation of AGTR2 and inflammatory cytokine IL-6 in the gut of infected RMs. Protein expression profiling also revealed significant upregulation of AGTR2 after infection. The expression of ACE2 in protein level was also decreased, but not significantly, after infection. To understand the entirety of the process in newly regenerated epithelial cells, a global transcriptomic study of enteroids raised from intestinal stem cells was performed. Interestingly, most of the genes associated with the RAS, such as DPP4, MME, ANPEP, ACE2, ENPEP, were found to be downregulated in SIV infection. HNFA1 was found to be a key regulator of ACE2 and related protein expression. Jejunum CD4+ T cell depletion and increased IL-6 mRNA, MCP-1 and AGTR2 expression may signal inflammation, monocyte/macrophage accumulation and epithelial apoptosis in accelerating SIV pathogenesis. Overall, the findings in the study suggested a possible impact of SIV/HIV infection on expression of ACE2 and RAS-associated proteins resulting in the loss of gut homeostasis. In the context of the current COVID-19 pandemic, the outcome of SARS-CoV-2 and HIV co-infection remains uncertain and needs further investigation as the significance profile of ACE2, a viral entry receptor for SARS-CoV-2, and its expression in mRNA and protein varied in the current study. There is a concern of aggravated SARS-CoV-2 outcomes due to possible serious pathological events in the gut resulting from compromised expression of RAS- associated proteins in SIV/HIV infection.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.