2019
DOI: 10.1186/s12859-019-3306-3
|View full text |Cite
|
Sign up to set email alerts
|

SpliceFinder: ab initio prediction of splice sites using convolutional neural network

Abstract: Background: Identifying splice sites is a necessary step to analyze the location and structure of genes. Two dinucleotides, GT and AG, are highly frequent on splice sites, and many other patterns are also on splice sites with important biological functions. Meanwhile, the dinucleotides occur frequently at the sequences without splice sites, which makes the prediction prone to generate false positives. Most existing tools select all the sequences with the two dimers and then focus on distinguishing the true spl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
66
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 57 publications
(66 citation statements)
references
References 35 publications
0
66
0
Order By: Relevance
“…Genetic variants arising due to RNA splicing are more frequently found in individuals having neurodevelopmental disorders, with variable mutation rates between the pre-messenger RNA (pre-mRNA) and mature RNA processing stages ( 3 , 4 ). These events in genetic mutation reflect marked differences in the balance of transcriptional regulation fidelity among neural networking models ( 5 ). A correlation between splicing-mediated mutation and the likelihood of response to neurodegenerative disorders (NDs), together with the identification of associations between gene mutations and clinical outcomes of NDs, can provide comprehensive information on AS for diagnosis.…”
mentioning
confidence: 99%
“…Genetic variants arising due to RNA splicing are more frequently found in individuals having neurodevelopmental disorders, with variable mutation rates between the pre-messenger RNA (pre-mRNA) and mature RNA processing stages ( 3 , 4 ). These events in genetic mutation reflect marked differences in the balance of transcriptional regulation fidelity among neural networking models ( 5 ). A correlation between splicing-mediated mutation and the likelihood of response to neurodegenerative disorders (NDs), together with the identification of associations between gene mutations and clinical outcomes of NDs, can provide comprehensive information on AS for diagnosis.…”
mentioning
confidence: 99%
“…The challenges involved in performing manual feature extraction and model training led to development of models using Artificial Neural Network (ANN) [31,32] that performed automated feature representation. Many DL architectures were used and developed for splice site prediction based on CNN [33,34,35,36,37], RNN [13,38], Restricted Boltzmann Machines (RBM) [39], Autoencoders [40,41] and Deep Belief Networks [39]. Although these DL architectures have removed the burden of manual feature extraction, they are still time consuming to train and a much deeper knowledge on SS associated functions and evolution has been strongly urged.…”
Section: Splice Site Recognition Problemmentioning
confidence: 99%
“…Koo et al [2018] used a similar procedure to understand how motif number and spacing impacts prediction. Meanwhile, a back-propagation method called DeepLIFT [Shrikumar et al, 2019] identifies motifs by comparing the gradient for a sample-of-interest against a reference [Wang et al, 2019]. In contrast, our model takes inspiration from self-explanation, in which interpretability is built-in architecturally [Alvarez-Melis and Jaakkola, 2018].…”
Section: Related Workmentioning
confidence: 99%
“…Interest has gathered recently around how machine learning, especially deep convolutional neural networks (CNNs), could be used to predict genomic events like RNA splicing [Ching et al, 2018, Jaganathan et al, 2019, Wang et al, 2019, Albaradei et al, 2020. Although deep CNNs achieve good performance, they do not learn an explicit model of motif pairs or their distancedependent interactions.…”
Section: Introductionmentioning
confidence: 99%