2018
DOI: 10.1093/bib/bby037
|View full text |Cite
|
Sign up to set email alerts
|

Predicting novel microRNA: a comprehensive comparison of machine learning approaches

Abstract: This review provides a comprehensive study and comparative assessment of methods from these two ML approaches for dealing with the prediction of novel pre-miRNAs: supervised and unsupervised training. We present and analyze the ML proposals that have appeared during the past 10 years in literature. They have been compared in several prediction tasks involving two model genomes and increasing imbalance levels. This work provides a review of existing ML approaches for pre-miRNA prediction and fair comparisons of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
33
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
2
2

Relationship

4
6

Authors

Journals

citations
Cited by 38 publications
(33 citation statements)
references
References 60 publications
0
33
0
Order By: Relevance
“…However, to the best of our knowledge, there are no such datasets available. Actually, in most published works, the datasets used for training and testing the prediction methods are manually built, use diverse methodologies according to each study [4], [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], and require a (not negligible) long time. Secondly, it is very hard to fairly compare among different classifiers.…”
Section: Experimental Design Materials and Methodsmentioning
confidence: 99%
“…However, to the best of our knowledge, there are no such datasets available. Actually, in most published works, the datasets used for training and testing the prediction methods are manually built, use diverse methodologies according to each study [4], [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], and require a (not negligible) long time. Secondly, it is very hard to fairly compare among different classifiers.…”
Section: Experimental Design Materials and Methodsmentioning
confidence: 99%
“…The first stage is data set preparation. Candidate peptide sequences were typically collected from a validated database [44]. To build a high-quality predictive model, training and test sets were required.…”
Section: A Framework Of the Proposed Cpp Predictormentioning
confidence: 99%
“…This high imbalance is very difficult to handle by any ML model (Bugnon et al, 2019). In spite of the fact that there is a myriad of methods for pre-miRNA classification published (Li et al, 2010;Gomes and et al, 2013;Shukla et al, 2017;Stegmayer et al, 2018;Chen et al, 2019), a study has clearly shown that the prediction of pre-miRNAs is yet far-away from being satisfactory solved because existing methods have a very high rate of false positives (Demirci et al, 2017). That is, they provide an excessively long list of candi-dates to novel pre-miRNAs, that cannot be validated with wet-experiments.…”
Section: Introductionmentioning
confidence: 99%