The vast majority of coding variants are rare, and assessment of the contribution of rare variants to complex traits is hampered by low statistical power and limited functional data. Improved methods for predicting the pathogenicity of rare coding variants are needed to facilitate the discovery of disease variants from exome sequencing studies. We developed REVEL (rare exome variant ensemble learner), an ensemble method for predicting the pathogenicity of missense variants on the basis of individual tools: MutPred, FATHMM, VEST, PolyPhen, SIFT, PROVEAN, MutationAssessor, MutationTaster, LRT, GERP, SiPhy, phyloP, and phastCons. REVEL was trained with recently discovered pathogenic and rare neutral missense variants, excluding those previously used to train its constituent tools. When applied to two independent test sets, REVEL had the best overall performance (p < 10) as compared to any individual tool and seven ensemble methods: MetaSVM, MetaLR, KGGSeq, Condel, CADD, DANN, and Eigen. Importantly, REVEL also had the best performance for distinguishing pathogenic from rare neutral variants with allele frequencies <0.5%. The area under the receiver operating characteristic curve (AUC) for REVEL was 0.046-0.182 higher in an independent test set of 935 recent SwissVar disease variants and 123,935 putatively neutral exome sequencing variants and 0.027-0.143 higher in an independent test set of 1,953 pathogenic and 2,406 benign variants recently reported in ClinVar than the AUCs for other ensemble methods. We provide pre-computed REVEL scores for all possible human missense variants to facilitate the identification of pathogenic variants in the sea of rare variants discovered as sequencing studies expand in scale.
Autosomal-dominant polycystic kidney disease (ADPKD) is a common, progressive, adult-onset disease that is an important cause of end-stage renal disease (ESRD), which requires transplantation or dialysis. Mutations in PKD1 or PKD2 (∼85% and ∼15% of resolved cases, respectively) are the known causes of ADPKD. Extrarenal manifestations include an increased level of intracranial aneurysms and polycystic liver disease (PLD), which can be severe and associated with significant morbidity. Autosomal-dominant PLD (ADPLD) with no or very few renal cysts is a separate disorder caused by PRKCSH, SEC63, or LRP5 mutations. After screening, 7%-10% of ADPKD-affected and ∼50% of ADPLD-affected families were genetically unresolved (GUR), suggesting further genetic heterogeneity of both disorders. Whole-exome sequencing of six GUR ADPKD-affected families identified one with a missense mutation in GANAB, encoding glucosidase II subunit α (GIIα). Because PRKCSH encodes GIIβ, GANAB is a strong ADPKD and ADPLD candidate gene. Sanger screening of 321 additional GUR families identified eight further likely mutations (six truncating), and a total of 20 affected individuals were identified in seven ADPKD- and two ADPLD-affected families. The phenotype was mild PKD and variable, including severe, PLD. Analysis of GANAB-null cells showed an absolute requirement of GIIα for maturation and surface and ciliary localization of the ADPKD proteins (PC1 and PC2), and reduced mature PC1 was seen in GANAB(+/-) cells. PC1 surface localization in GANAB(-/-) cells was rescued by wild-type, but not mutant, GIIα. Overall, we show that GANAB mutations cause ADPKD and ADPLD and that the cystogenesis is most likely driven by defects in PC1 maturation.
BackgroundAlthough the costs of next generation sequencing technology have decreased over the past years, there is still a lack of simple-to-use applications, for a comprehensive analysis of RNA sequencing data. There is no one-stop shop for transcriptomic genomics. We have developed MAP-RSeq, a comprehensive computational workflow that can be used for obtaining genomic features from transcriptomic sequencing data, for any genome.ResultsFor optimization of tools and parameters, MAP-RSeq was validated using both simulated and real datasets. MAP-RSeq workflow consists of six major modules such as alignment of reads, quality assessment of reads, gene expression assessment and exon read counting, identification of expressed single nucleotide variants (SNVs), detection of fusion transcripts, summarization of transcriptomics data and final report. This workflow is available for Human transcriptome analysis and can be easily adapted and used for other genomes. Several clinical and research projects at the Mayo Clinic have applied the MAP-RSeq workflow for RNA-Seq studies. The results from MAP-RSeq have thus far enabled clinicians and researchers to understand the transcriptomic landscape of diseases for better diagnosis and treatment of patients.ConclusionsOur software provides gene counts, exon counts, fusion candidates, expressed single nucleotide variants, mapping statistics, visualizations, and a detailed research data report for RNA-Seq. The workflow can be executed on a standalone virtual machine or on a parallel Sun Grid Engine cluster. The software can be downloaded from http://bioinformaticstools.mayo.edu/research/maprseq/.
Autosomal-dominant polycystic kidney disease (ADPKD) is characterized by the progressive development of kidney cysts, often resulting in end-stage renal disease (ESRD). This disorder is genetically heterogeneous with ∼7% of families genetically unresolved. We performed whole-exome sequencing (WES) in two multiplex ADPKD-like pedigrees, and we analyzed a further 591 genetically unresolved, phenotypically similar families by targeted next-generation sequencing of 65 candidate genes. WES identified a DNAJB11 missense variant (p.Pro54Arg) in two family members presenting with non-enlarged polycystic kidneys and a frameshifting change (c.166_167insTT) in a second family with small renal and liver cysts. DNAJB11 is a co-factor of BiP, a key chaperone in the endoplasmic reticulum controlling folding, trafficking, and degradation of secreted and membrane proteins. Five additional multigenerational families carrying DNAJB11 mutations were identified by the targeted analysis. The clinical phenotype was consistent in the 23 affected members, with non-enlarged cystic kidneys that often evolved to kidney atrophy; 7 subjects reached ESRD from 59 to 89 years. The lack of kidney enlargement, histologically evident interstitial fibrosis in non-cystic parenchyma, and recurring episodes of gout (one family) suggested partial phenotypic overlap with autosomal-dominant tubulointerstitial diseases (ADTKD). Characterization of DNAJB11-null cells and kidney samples from affected individuals revealed a pathogenesis associated with maturation and trafficking defects involving the ADPKD protein, PC1, and ADTKD proteins, such as UMOD. DNAJB11-associated disease is a phenotypic hybrid of ADPKD and ADTKD, characterized by normal-sized cystic kidneys and progressive interstitial fibrosis resulting in late-onset ESRD.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.