2018
DOI: 10.1261/rna.069112.118
|View full text |Cite
|
Sign up to set email alerts
|

Gene2vec: gene subsequence embedding for prediction of mammalian N6-methyladenosine sites from mRNA

Abstract: N 6 -Methyladenosine (m 6 A) refers to methylation modification of the adenosine nucleotide acid at the nitrogen-6 position. Many conventional computational methods for identifying N 6 -methyladenosine sites are limited by the small amount of data available. Taking advantage of the thousands of m 6 A sites detected by high-throughput sequencing, it is now possible to discover the characteristics of m 6 A sequences using deep learning techniques. To the best of our knowledge, our work is the first attempt to us… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
229
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
9
1

Relationship

1
9

Authors

Journals

citations
Cited by 449 publications
(232 citation statements)
references
References 68 publications
3
229
0
Order By: Relevance
“…Based on this consideration, we tried to describe the protein fragment with features as concise as possible to make the predictor simpler and more efficient. After a series of experiments, we selected two types of encoding schemes to represent the protein fragment: one-hot code [28][29][30] and a position-specific scoring matrix (PSSM), where the former one encodes the residues arrangement and the latter one reflect the evolutionary profile. However, reducing the number of features will inevitably result in less information that may get by the predictor and cause performance deterioration.…”
Section: Methodsmentioning
confidence: 99%
“…Based on this consideration, we tried to describe the protein fragment with features as concise as possible to make the predictor simpler and more efficient. After a series of experiments, we selected two types of encoding schemes to represent the protein fragment: one-hot code [28][29][30] and a position-specific scoring matrix (PSSM), where the former one encodes the residues arrangement and the latter one reflect the evolutionary profile. However, reducing the number of features will inevitably result in less information that may get by the predictor and cause performance deterioration.…”
Section: Methodsmentioning
confidence: 99%
“…As a representative branch of deep learning, CNN [34] had already made great achievements in various research fields, including protein sequence studies [35][36][37][38][39]. It can capture various nonlinear features by constructing neural networks consisting of convolution, pooling, and fully connected layers.…”
Section: Deep Learning Networkmentioning
confidence: 99%
“…As an alternative to costly and labor-intensive laboratory experiments, robust, swift, and inexpensive computational methods for RNA chemical modification prediction have emerged recently, owing to the increasing amount of data generated in this post-genomics era (Libbrecht and Noble, 2015). A large number of m6A (Chen et al, 2015(Chen et al, , 2018a(Chen et al, ,b, 2019aZhou et al, 2016;Zhao et al, 2019;Zou et al, 2019) and m5C (Feng et al, 2016;Qiu et al, 2017;Li et al, 2018;Sabooh et al, 2018;Zhang et al, 2018;Yin et al, 2019) site predictors based on traditional machine learning and emerging deep learning algorithms have been proposed. However, few computational tools have been developed to predict pseudouridine sites.…”
Section: Introductionmentioning
confidence: 99%