2018
DOI: 10.1101/438986
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Modular modeling improves the predictions of genetic variant effects on splicing

Abstract: Predicting the effects of genetic variants on splicing is highly relevant for human genetics. We describe the framework MMSplice (modular modeling of splicing) with which we built the winning model of the CAGI 2018 exon skipping prediction challenge. The MMSplice modules are neural networks scoring exon, intron, and splice sites, trained on distinct large-scale genomics datasets. These modules are combined to predict effects of variants on exon skipping, alternative donor and acceptor sites, splicing efficienc… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
2
1
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 47 publications
0
3
0
Order By: Relevance
“…This last point highlights the complexity of splicing that does not only depend on the 5'ss, 3'ss and the BP. To illustrate this complexity, a recent study was published [23] demonstrating the MMSplice tool which gathers several features from intronic and exonic pre-mRNA sequences. This tool was assayed on the Vex-seq data [24] To investigate potential spliceogenic variants occurring in the BP area, we gathered a large collection of 120 human variants (62 unpublished), with their corresponding in vitro RNA data.…”
Section: Discussionmentioning
confidence: 99%
“…This last point highlights the complexity of splicing that does not only depend on the 5'ss, 3'ss and the BP. To illustrate this complexity, a recent study was published [23] demonstrating the MMSplice tool which gathers several features from intronic and exonic pre-mRNA sequences. This tool was assayed on the Vex-seq data [24] To investigate potential spliceogenic variants occurring in the BP area, we gathered a large collection of 120 human variants (62 unpublished), with their corresponding in vitro RNA data.…”
Section: Discussionmentioning
confidence: 99%
“…Bioinformatic analysis of probable splicing alterations was performed with freely available tools: HumanSplicingFinder 3.1 (HSF3.1) ( ) (accessed on 12 June 2020) [ 20 ], SpliceAI [ 21 ], MMSplice [ 22 ], SpiP [ 23 ], Ex-Skip ( ) (accessed on 15 April 2021) [ 24 ], Hexplorer ( ) (accessed on 15 April 2021) [ 25 ] and HExoSplice ( ) (accessed on 15 April 2021) [ 26 ].…”
Section: Methodsmentioning
confidence: 99%
“…Agarwal et al predicted gene activity from 10 kbp DNA sequences surrounding the TSS(Agarwal and Shendure (2020)). The authors could not find motifs used by the network but an analysis of the over-represented k-mers in the promoters of highly active genes (according to the network) reveals the importance of CpG islands in predicting gene activity.Splicing, translation and polyadenylation of RNACheng et al developed a neural network to predict gene splice sites from the RNA sequence(Cheng et al (2019(Cheng et al ( , 2021). Analysis of the effect of variants using this network shows its utility in understanding the genomic causes of autism.…”
mentioning
confidence: 99%