Age is the dominant risk factor for most chronic human diseases; yet the mechanisms by which aging confers this risk are largely unknown. 1 Recently, the age-related acquisition of somatic mutations in regenerating hematopoietic stem cell populations leading to clonal expansion was associated with both hematologic cancer 2 – 4 and coronary heart disease 5 , a phenomenon termed ‘Clonal Hematopoiesis of Indeterminate Potential’ (CHIP). 6 Simultaneous germline and somatic whole genome sequence analysis now provides the opportunity to identify root causes of CHIP. Here, we analyze high-coverage whole genome sequences from 97,691 participants of diverse ancestries in the NHLBI TOPMed program and identify 4,229 individuals with CHIP. We identify associations with blood cell, lipid, and inflammatory traits specific to different CHIP genes. Association of a genome-wide set of germline genetic variants identified three genetic loci associated with CHIP status, including one locus at TET2 that was African ancestry specific. In silico -informed in vitro evaluation of the TET2 germline locus identified a causal variant that disrupts a TET2 distal enhancer resulting in increased hematopoietic stem cell self-renewal. Overall, we observe that germline genetic variation shapes hematopoietic stem cell function leading to CHIP through mechanisms that are both specific to clonal hematopoiesis and shared mechanisms leading to somatic mutations across tissues.
Large-scale whole genome sequencing (WGS) studies have enabled the analysis of rare variants (RVs) associated with complex phenotypes. Commonly used RV association tests (RVATs) have limited scope to leverage variant functions. We propose STAAR (variant-Set Test for Association using Annotation infoRmation), a scalable and powerful RVAT method that effectively incorporates both variant categories and multiple complementary annotations using a dynamic weighting scheme. For the latter, we introduce “annotation Principal Components”, multi-dimensional summaries of in silico variant annotations. STAAR accounts for population structure and relatedness, and is scalable for analyzing very large cohort and biobank WGS studies of continuous and dichotomous traits. We applied STAAR to identify RVs associated with four lipid traits in 12,316 discovery samples and 17,822 replication samples from the Trans-Omics for Precision Medicine program. We discovered and replicated novel RV associations, including disruptive missense RVs of NPC1L1 and an intergenic region near APOC1P1 associated with low-density lipoprotein cholesterol.
Y, et al. (2019) Use of >100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations. PLoS Genet 15(12): e1008500.
Context Although genetic influences on bipolar disorder are well established, localization of genes that predispose to the illness has proven difficult. Given that genes predisposing to bipolar disorder may be transmitted without expression of the categorical clinical phenotype, one strategy for identifying risk genes is the use of quantitative endophenotypes. Objective The goal of the current study is to adjudicate neurocognitive endophenotypes for bipolar disorder. Design, Setting, and Participants 709 Latino individuals from the central valley of Costa Rica, Mexico City, Mexico, or San Antonio, Texas participated in the study. 660 of these persons were members of extended pedigrees with at least two siblings diagnosed with bipolar disorder (n=230). The remaining subjects were community controls drawn from each site and without personal or family history of bipolar disorder or schizophrenia. All subjects received psychodiagnostic interviews and comprehensive neurocognitive evaluations. Neurocognitive measures found to be heritable were entered into analyses designed to determine which tests are impaired in affected individuals, sensitive to genetic liability for the illness and genetically correlated with affection status. Main Outcome Measures The main outcome measure was neurocognitive test performance. Results Two of the 21 neurocognitive variables were not significantly heritable and were excluded from subsequent analyses. Patients with bipolar disorder were impaired on 6 of these cognitive measures compared to non-related healthy subjects. Non-bipolar first-degree relatives were impaired on five of these and three tests were genetically correlated with affection status: digit symbol coding, object delayed response, and immediate facial memory. Conclusions This large-scale extended pedigree study of cognitive functioning in bipolar disorder identified measures of processing speed, working memory and declarative (facial) memory as candidate endophenotypes for bipolar disorder.
The data set simulated for Genetic Analysis Workshop 17 was designed to mimic a subset of data that might be produced in a full exome screen for a complex disorder and related risk factors in order to permit workshop participants to investigate issues of study design and statistical genetic analysis. Real sequence data from the 1000 Genomes Project formed the basis for simulating a common disease trait with a prevalence of 30% and three related quantitative risk factors in a sample of 697 unrelated individuals and a second sample of 697 individuals in large, extended pedigrees. Called genotypes for 24,487 autosomal markers assigned to 3,205 genes and simulated affection status, quantitative traits, age, sex, pedigree relationships, and cigarette smoking were provided to workshop participants. The simulating model included both common and rare variants with minor allele frequencies ranging from 0.07% to 25.8% and a wide range of effect sizes for these variants. Genotype-smoking interaction effects were included for variants in one gene. Functional variants were concentrated in genes selected from specific biological pathways and were selected on the basis of the predicted deleteriousness of the coding change. For each sample, unrelated individuals and family, 200 replicates of the phenotypes were simulated.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.