Background Genomic variants which disrupt splicing are a major cause of rare genetic diseases. However, variants which lie outside of the canonical splice sites are difficult to interpret clinically. Improving the clinical interpretation of non-canonical splicing variants offers a major opportunity to uplift diagnostic yields from whole genome sequencing data. Methods Here, we examine the landscape of splicing variants in whole-genome sequencing data from 38,688 individuals in the 100,000 Genomes Project and assess the contribution of non-canonical splicing variants to rare genetic diseases. We use a variant-level constraint metric (the mutability-adjusted proportion of singletons) to identify constrained functional variant classes near exon–intron junctions and at putative splicing branchpoints. To identify new diagnoses for individuals with unsolved rare diseases in the 100,000 Genomes Project, we identified individuals with de novo single-nucleotide variants near exon–intron boundaries and at putative splicing branchpoints in known disease genes. We identified candidate diagnostic variants through manual phenotype matching and confirmed new molecular diagnoses through clinical variant interpretation and functional RNA studies. Results We show that near-splice positions and splicing branchpoints are highly constrained by purifying selection and harbour potentially damaging non-coding variants which are amenable to systematic analysis in sequencing data. From 258 de novo splicing variants in known rare disease genes, we identify 35 new likely diagnoses in probands with an unsolved rare disease. To date, we have confirmed a new diagnosis for six individuals, including four in whom RNA studies were performed. Conclusions Overall, we demonstrate the clinical value of examining non-canonical splicing variants in individuals with unsolved rare diseases.
Genomic variants which disrupt splicing are a major cause of rare genetic disease. However, variants which lie outside of the canonical splice sites are difficult to interpret clinically. Here, we examine the landscape of splicing variants in whole-genome sequencing data from 38,688 individuals in the 100,000 Genomes Project, and assess the contribution of non-canonical splicing variants to rare genetic diseases. We show that splicing branchpoints are highly constrained by purifying selection, and harbour damaging non-coding variants which are amenable to systematic analysis in sequencing data. From 258 de novo splicing variants in known rare disease genes, we identify 35 new likely diagnoses in probands with an unsolved rare disease. We use phenotype matching and RNA studies to confirm a new diagnosis for six individuals to date. In summary, we demonstrate the clinical value of examining non-canonical splicing variants in participants with unsolved rare diseases.
Use of blood RNA sequencing (RNA-seq) as a splicing analysis tool for clinical interpretation of variants of uncertain significance (VUSs) found via whole-genome and exome sequencing can be difficult for genes that have low expression in the blood due to insufficient read count coverage aligned to specific genes of interest.Here, we present a short amplicon reverse transcription-polymerase chain reaction (RT-PCR) for the detection of genes with low blood expression. Short amplicon RT-PCR, is designed to span three exons where an exon harboring a variant is flanked by one upstream and one downstream exon. We tested short amplicon RT-PCRs for genes that have median transcripts per million (TPM) values less than one according to the genotype-tissue expression database. Median TPM values of genes analyzed in this study are SYN1 = 0.8549, COL1A1 = 0.6275, TCF4 = 0.4009, DSP = .2894, TTN = 0.2851, COL5A2 = 0.1036, TERT = 0.04452, NTRK2 = 0.0344, ABCA4 = 0.00744, PRPH = 0, and WT1 = 0. All these genes show insufficient exon-spanning read coverage in our RNA-seq data to allow splicing analysis. We successfully detected all genes tested except PRPH and WT1. Aberrant splicing was detected in SYN1, TCF4, NTRK2, TTN, and TERT VUSs. Therefore, our results show short amplicon RT-PCR is a useful alternative for the analysis of splicing events in genes with low TPM in blood RNA for clinical diagnostics.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.