2016
DOI: 10.1371/journal.pone.0154567
|View full text |Cite
|
Sign up to set email alerts
|

LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature

Abstract: As a novel class of noncoding RNAs, long noncoding RNAs (lncRNAs) have been verified to be associated with various diseases. As large scale transcripts are generated every year, it is significant to accurately and quickly identify lncRNAs from thousands of assembled transcripts. To accurately discover new lncRNAs, we develop a classification tool of random forest (RF) named LncRNApred based on a new hybrid feature. This hybrid feature set includes three new proposed features, which are MaxORF, RMaxORF and SNR.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
43
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 55 publications
(44 citation statements)
references
References 77 publications
1
43
0
Order By: Relevance
“…Several computational tools have been developed to estimate the coding potential of novel identified transcripts. Among the most utilized CPC (Kong et al, 2007) and its updated version CPC2 (Kang et al, 2017), CPAT , COME (Hu et al, 2017), LncRNApred (Pian et al, 2016), PORTRAIT (Arrial et al, 2009), CONC, and others can be cited. The experimental approach is mainly based on the genomewide translatome that has been studied by ribosome footprinting, a technique introduced by Ingolia in 2009 (Ingolia et al, 2009).…”
Section: Lncrna-derived Micropeptidesmentioning
confidence: 99%
“…Several computational tools have been developed to estimate the coding potential of novel identified transcripts. Among the most utilized CPC (Kong et al, 2007) and its updated version CPC2 (Kang et al, 2017), CPAT , COME (Hu et al, 2017), LncRNApred (Pian et al, 2016), PORTRAIT (Arrial et al, 2009), CONC, and others can be cited. The experimental approach is mainly based on the genomewide translatome that has been studied by ribosome footprinting, a technique introduced by Ingolia in 2009 (Ingolia et al, 2009).…”
Section: Lncrna-derived Micropeptidesmentioning
confidence: 99%
“…Then, we compared several popular machine learning based methods, including Coding Potential Calculator (CPC) [28], Coding Potential Assessment Tool (CPAT) [29], Coding-Non-Coding Index (CNCI) [30], predictor of long noncoding RNAs and messenger RNAs based on an improved k -mer scheme (PLEK) [31], Long noncoding RNA IDentification (LncRNA-ID) [32], and lncRScan-SVM [33]. In addition, lncRNA-MFDL [34] and LncRNApred [35], two artificial neural network- (ANN-) involved tools, are also introduced in this paper. However, the provided access link of lncRNA-MFDL has been forbidden; LncRNApred often throws errors while handling massive-scale data which can be processed by CPC and CPAT successfully.…”
Section: Introductionmentioning
confidence: 99%
“…Coding Potential Assessment, which is also another popular approach, uses an alignment-free logistic regression model to determine the coding potential of RNAs [121]. To classify lncRNAs by machine learning techniques, there are CNCI [122], PLEK [123], LncRScan-SVM [124], lncRNA-MFDL [125], lncRNA-ID [126] and lncRNApred [127]. In human and mice, ISeeRNA [128], linc-SF [129] and DeepLNC [130] use machine learning techniques to categorize lncRNAs from transcriptome sequencing data [85].…”
Section: Bioinformatics Approachesmentioning
confidence: 99%