2018
DOI: 10.1093/nar/gky567
|View full text |Cite
|
Sign up to set email alerts
|

A deep recurrent neural network discovers complex biological rules to decipher RNA protein-coding potential

Abstract: The current deluge of newly identified RNA transcripts presents a singular opportunity for improved assessment of coding potential, a cornerstone of genome annotation, and for machine-driven discovery of biological knowledge. While traditional, feature-based methods for RNA classification are limited by current scientific knowledge, deep learning methods can independently discover complex biological rules in the data de novo. We trained a gated recurrent neural network (RNN) on human messenger RNA (mRNA) and l… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
74
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 79 publications
(74 citation statements)
references
References 22 publications
0
74
0
Order By: Relevance
“…al. approach and mapped an input sequence of nucleotides to a sequence of vectors using an embedding layer (Hill et al, 2018). This sequence of vectors was then fed into a bidirectional recurrent neural network ( Fig.…”
Section: Classifiersmentioning
confidence: 99%
See 2 more Smart Citations
“…al. approach and mapped an input sequence of nucleotides to a sequence of vectors using an embedding layer (Hill et al, 2018). This sequence of vectors was then fed into a bidirectional recurrent neural network ( Fig.…”
Section: Classifiersmentioning
confidence: 99%
“…Convolutional neural networks apply small kernels in a sliding window across input sequences and have been shown to learn meaningful motif "features" from sequence data (Min et al, 2017). In fact, deep learning sequence based techniques have been applied to a variety of tasks including prediction of enhancers (Min et al, 2017), accessibility of DNA sequences (Kelley et al, 2016), the effect of non-coding variants (Zhou and Troyanskaya, 2015), sequence specificity of binding proteins (Alipanahi et al, 2015), and RNA protein coding potential (Hill et al, 2018). Often in these models, nucleotides are represented using one-hot encoded vectors.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…develop a model of mRNN based on Recurrent Neural Networks (RNN) , and they predict the coding potential of sORF (Hill, Kuintzle, Teegarden, Merrill, Danaee and Hendrix 2018). In November 2019, Zhu et al also develop a tool specifically for predicting micropeptides, MiPepid, which is a tool developed by using K-mer features and machine learning (Zhu and Gribskov 2019).…”
Section: Introductionmentioning
confidence: 99%
“…Deep learning allows computational models composed of multiple processing layers to learn representations of data with multiple levels of abstraction 22 . Deep learning models can identify dependencies and complex structures in very highdimensional data -such as CLIP data -and have been used, for example, for predicting the effects of mutations in non-coding DNA on gene expression and disease 1,23 , predicting DNA function 24 , mRNA coding potential 25 , and prediction of subcellular locations of proteins 26 . Starting with DeepBind 27 , which uses convolutional neural networks (CNN) to classify bound sequences and nonbound, neural networks have also been used to directly model RBP preferences.…”
Section: Introductionmentioning
confidence: 99%