2022
DOI: 10.1371/journal.pone.0269159
|View full text |Cite
|
Sign up to set email alerts
|

CI-SpliceAI—Improving machine learning predictions of disease causing splicing variants using curated alternative splice sites

Abstract: Background It is estimated that up to 50% of all disease causing variants disrupt splicing. Due to its complexity, our ability to predict which variants disrupt splicing is limited, meaning missed diagnoses for patients. The emergence of machine learning for targeted medicine holds great potential to improve prediction of splice disrupting variants. The recently published SpliceAI algorithm utilises deep neural networks and has been reported to have a greater accuracy than other commonly used methods. Method… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
15
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 27 publications
(15 citation statements)
references
References 36 publications
0
15
0
Order By: Relevance
“…However, most SpliceAI current public implementations or pre-computed whole genome VCFs currently do not process complex delins variations (i.e., other than deletion, insertion, or substitution), nor does Pangolin. Of note, those complex variations are handled by CI-SpliceAI but with numerical results [12]. The functional study of this variant by a minigene assay has shown the skipping of an entire out-of-frame exon [30].…”
Section: Interpreting Complex Delinsmentioning
confidence: 99%
See 2 more Smart Citations
“…However, most SpliceAI current public implementations or pre-computed whole genome VCFs currently do not process complex delins variations (i.e., other than deletion, insertion, or substitution), nor does Pangolin. Of note, those complex variations are handled by CI-SpliceAI but with numerical results [12]. The functional study of this variant by a minigene assay has shown the skipping of an entire out-of-frame exon [30].…”
Section: Interpreting Complex Delinsmentioning
confidence: 99%
“…This de Sainte Agathe et al Human Genomics (2023) 17:7 ability to focus not only on the nearby site (destruction or creation) but at the whole transcript level is a unique feature of these deep-learning-based next-generation splicing predictors, such as SpliceAI or Pangolin [11]. In a recent improvement, the SpliceAI neural network has been retrained with a curated and manually validated isoforms dataset [12]. Still, the standard version of SpliceAI (currently v1.3.1) has some limitations.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In every case, the organism-specific SMsplice model had the best performance, as expected. As a point of comparison, we also applied the neural net model CI-SpliceAI to the same test sets (Strauch et al, 2022). This black box model was trained on human sequences and so the most direct comparison is to the SMsplice model with fully human parameters.…”
Section: Smsplice Modelmentioning
confidence: 99%
“…For example, Pangolin [26] uses splicing quantifications from multiple species and tissues to not only predict whether a position is a splice site (as SpliceAI does) but also to predict splice site usage (e.g., how much a splice site is being used in a given tissue). In contrast, CI-SpliceAI [27] uses different training labels for true and false splice site positions based on a collapsed transcript structure derived from GENCODE [28] annotations.…”
Section: Introductionmentioning
confidence: 99%