Summary paragraphThe Trans-Omics for Precision Medicine (TOPMed) program seeks to elucidate the genetic architecture and disease biology of heart, lung, blood, and sleep disorders, with the ultimate goal of improving diagnosis, treatment, and prevention. The initial phases of the program focus on whole genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here, we describe TOPMed goals and design as well as resources and early insights from the sequence data. The resources include a variant browser, a genotype imputation panel, and sharing of genomic and phenotypic data via dbGaP. In 53,581 TOPMed samples, >400 million single-nucleotide and insertion/deletion variants were detected by alignment with the reference genome. Additional novel variants are detectable through assembly of unmapped reads and customized analysis in highly variable loci. Among the >400 million variants detected, 97% have frequency <1% and 46% are singletons. These rare variants provide insights into mutational processes and recent human evolutionary history. The nearly complete catalog of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and non-coding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and extends the reach of nearly all genome-wide association studies to include variants down to ~0.01% in frequency.
The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.
PurposeGenotyping CYP2D6 is important for precision drug therapy because it metabolizes approximately 25% of drugs and its activity varies considerably among individuals. Genotype analysis of CYP2D6 is challenging due to its highly polymorphic nature. Over 100 haplotypes (star alleles) have been defined for CYP2D6, some involving a gene conversion with its nearby non-functional but highly homologous paralog CYP2D7. We present Stargazer, a new bioinformatics tool that uses next-generation sequencing (NGS) data to call star alleles for CYP2D6 (https://stargazer.gs.washington.edu/stargazerweb/). Stargazer is currently being extended for other pharmacogenes.MethodsStargazer identifies star alleles from NGS data by detecting single nucleotide variants, insertion-deletion variants, and structural variants. Stargazer detects structural variation including gene deletions, duplications, and conversions by calculating paralog-specific copy number from read depth.ResultsWe applied Stargazer to NGS data of 32 ethnically diverse HapMap trios that were genotyped by TaqMan assays, long-range PCR, quantitative multiplex PCR, High Resolution Melt analysis, and/or Sanger sequencing. CYP2D6 genotyping by Stargazer was 99.0% concordant with data obtained by these methods and showed 28.1% of the samples had structural variation including CYP2D6/CYP2D7 hybrids.ConclusionAccurate genotyping of pharmacogenes with NGS and subsequent allele calling with Stargazer will aid the implementation of precision drug therapy.
Variation in the enzymatic activity of pharmacogenes is defined by star alleles (haplotypes) comprised of single‐nucleotide variants, small insertion‐deletions, and large structural variants. We recently developed Stargazer, a next‐generation sequencing‐based tool to call star alleles for the clinically important CYP2D6 gene. Here, we present the utility of extending Stargazer to call star alleles for 28 pharmacogenes using whole genome sequencing (WGS) data. We applied Stargazer to WGS data from 70 ethnically diverse samples from the Genetic Testing Reference Materials Coordination Program (GeT‐RM). These reference samples were extensively characterized by GeT‐RM using multiple pharmacogenetic testing assays. In all 28 genes, Stargazer recalled 100% of star alleles (N = 92) present in GeT‐RM's consensus genotypes (N = 1,559). Stargazer also detected star alleles not previously reported by GeT‐RM, including complex structural variants. Our results demonstrate that combining WGS data and Stargazer enables automated, accurate, and comprehensive genotyping of pharmacogenes in the human genome.
The major objective of this study was to investigate the association of genetic and nongenetic factors with variability in protein abundance and in vitro activity of the androgen-metabolizing enzyme UGT2B17 in human liver microsomes ( = 455). UGT2B17 abundance was quantified by liquid chromatography-tandem mass spectrometry proteomics, and enzyme activity was determined by using testosterone and dihydrotestosterone as in vitro probe substrates. Genotyping or gene resequencing and mRNA expression were also evaluated. Multivariate analysis was used to test the association of UGT2B17 copy number variation, single nucleotide polymorphisms (SNPs), age, and sex with its mRNA expression, abundance, and activity. UGT2B17 gene copy number and SNPs (rs7436962, rs9996186, rs28374627, and rs4860305) were associated with gene expression, protein levels, and androgen glucuronidation rates in a gene dose-dependent manner. UGT2B17 protein (mean ± S.D. picomoles per milligram of microsomal protein) is sparsely expressed in children younger than 9 years (0.12 ± 0.24 years) but profoundly increases from age 9 years to adults (∼10-fold) with ∼2.6-fold greater abundance in males than in females (1.2 vs. 0.47). Association of androgen glucuronidation with UGT2B15 abundance was observed only in the low UGT2B17 expressers. These data can be used to predict variability in the metabolism of UGT2B17 substrates. Drug companies should include UGT2B17 in early phenotyping assays during drug discovery to avoid late clinical failures.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.