2021
DOI: 10.1038/s41598-021-99747-2
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of in silico strategies to prioritize rare genomic variants impacting RNA splicing for the diagnosis of genomic disorders

Abstract: The development of computational methods to assess pathogenicity of pre-messenger RNA splicing variants is critical for diagnosis of human disease. We assessed the capability of eight algorithms, and a consensus approach, to prioritize 249 variants of uncertain significance (VUSs) that underwent splicing functional analyses. The capability of algorithms to differentiate VUSs away from the immediate splice site as being ‘pathogenic’ or ‘benign’ is likely to have substantial impact on diagnostic testing. We show… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

2
24
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8

Relationship

3
5

Authors

Journals

citations
Cited by 44 publications
(26 citation statements)
references
References 52 publications
2
24
0
Order By: Relevance
“…Pathogenic variants, both protein-coding and intronic, that lie outside canonical splice sites may nonetheless act to disrupt pre-mRNA splicing through a diverse series of mechanisms ( Figure S1 ). 1 , 2 , 3 Effective identification of pathogenic splice-impacting variants remains challenging and is limited by the omission of intronic regions in targeted sequencing approaches, 4 , 5 discordance between in silico variant prioritization tools, 6 and the lack of availability of the appropriate tissue from which to survey RNA for splicing disruption. 7 , 8 …”
Section: Introductionmentioning
confidence: 99%
“…Pathogenic variants, both protein-coding and intronic, that lie outside canonical splice sites may nonetheless act to disrupt pre-mRNA splicing through a diverse series of mechanisms ( Figure S1 ). 1 , 2 , 3 Effective identification of pathogenic splice-impacting variants remains challenging and is limited by the omission of intronic regions in targeted sequencing approaches, 4 , 5 discordance between in silico variant prioritization tools, 6 and the lack of availability of the appropriate tissue from which to survey RNA for splicing disruption. 7 , 8 …”
Section: Introductionmentioning
confidence: 99%
“…On this data, CI-SpliceAI had the greatest accuracy of all tools tested on both the binary task and when predicting the exact variant effect. SpliceAI has performed favourably in many comparisons since its release in 2019 [ 16 , 17 , 20 , 24 , 29 ], and it was suggested that the simultaneous prediction of thousands of nucleotides around a variant is the key advantage for its success: the big window sizes might allow SpliceAI to recognise pairs of acceptors and donors and other co-dependent features not only near splice sites, but also deep within the exon or intron. Feature maps within a convolutional neural network that recognise patterns in data are applied as a sliding window, allowing splicing factors such as binding sites or the branch point to be recognised independent of their offset to a splice site, and variants within their motifs are considered in the classification.…”
Section: Discussionmentioning
confidence: 99%
“…Many splice prediction tools exist, but there is little consensus on which the best tools are, what thresholds should be used to detect significant disruptions, and how these applications should be applied in clinical diagnostics. Recent applications of machine learning to splice site prediction show great promise, and have been found to be more accurate than older but still widely used methods such as MaxEntScan (MES) [ 15 17 ]. MES models the likelihood of a splice site given 9 or 23 bases using a maximum entropy model; MMSplice [ 18 ] uses neural networks to predict splice sites given 18 or 53 nucleotides; SQUIRLS [ 19 ] uses carefully engineered features from around the splice sites to classify using decision trees; and SpliceAI [ 20 ] uses five deep convolutional neural networks to predict splice sites based on 10,000 nucleotides of context.…”
Section: Introductionmentioning
confidence: 99%
“…Although SpliceAI is regarded as the best single method to predict aberrant splicing, the combined use of different tools can further increase accuracy. 27 , 28 It is hypothesized that the c.383-1368A>G variant creates a novel binding site for the splicing enhancer, protein SRSF1 (SF2/ASF), leading to the inclusion of a pseudo-exon containing a premature stop codon. Although a minigene splicing assay with sequence analysis of the mutated transcript exactly matched aberrant splicing as predicted by combining deep learning splice prediction tools, 22 , 23 additional families would further support and validate our results.…”
Section: Discussionmentioning
confidence: 99%