2017
DOI: 10.1186/s12859-017-1839-x
|View full text |Cite
|
Sign up to set email alerts
|

Fangorn Forest (F2): a machine learning approach to classify genes and genera in the family Geminiviridae

Abstract: BackgroundGeminiviruses infect a broad range of cultivated and non-cultivated plants, causing significant economic losses worldwide. The studies of the diversity of species, taxonomy, mechanisms of evolution, geographic distribution, and mechanisms of interaction of these pathogens with the host have greatly increased in recent years. Furthermore, the use of rolling circle amplification (RCA) and advanced metagenomics approaches have enabled the elucidation of viromes and the identification of many viral agent… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 58 publications
0
5
0
Order By: Relevance
“…An excellent effort by Silva and collaborators have developed Fangorn Forest, a ML based method, for classification of geminiviruses. Among the three tested algorithms, random forest (RF) has proven to be best in classification of genes and genera of this largest plant virus family [ 53 ]. Recently, a CNN guided sequencing platform has successfully completed human genome sequencing within couple of hours and efficiently identified the disease-causing variations in the genome.…”
Section: Main Textmentioning
confidence: 99%
“…An excellent effort by Silva and collaborators have developed Fangorn Forest, a ML based method, for classification of geminiviruses. Among the three tested algorithms, random forest (RF) has proven to be best in classification of genes and genera of this largest plant virus family [ 53 ]. Recently, a CNN guided sequencing platform has successfully completed human genome sequencing within couple of hours and efficiently identified the disease-causing variations in the genome.…”
Section: Main Textmentioning
confidence: 99%
“…The prediction model consists of three steps subsequently built with trained models and different algorithms capable of distinguishing RLP from NRLP, RLP from RLKs, and finally, predicting an RLP subfamily. The combination of several ML models with different algorithms has been applied for protein and viral sequence classification [ 58 , 63 ]. Using different classifiers requires methods that compile the results of the classifiers into a single final prediction.…”
Section: Discussionmentioning
confidence: 99%
“…ML has been extensively used in all sorts of thematic issues, from medicine to robotics [ 55 , 56 , 57 ]. In plant science, ML has been applied for viral gene identification [ 58 ], the diagnosis of bacterial infection [ 59 ], salt stress tolerance [ 60 ], and the taxonomy of grapevine [ 61 ], in addition to global analysis of gene expression, in response to hormones and environmental stresses [ 62 ], plant immunity, and miRNA network prediction [ 54 ]. Trained models have also been successfully used for functional protein classification in plant genomes [ 63 ].…”
Section: Introductionmentioning
confidence: 99%
“…Many different computational methods or algorithms are used to classify geminiviruses only based on the genomic information. For example, Silva et al (2017b) presented a machine learning (ML) classification model, called Fangorn Forest (F2), based on only genomic characteristics to classify genera and genes in the Geminiviridae family. All genera of the family Geminiviridae could be classified with high accuracy (Silva et al, 2017b).…”
Section: Introductionmentioning
confidence: 99%
“…(2017b) presented a machine learning (ML) classification model, called Fangorn Forest (F2), based on only genomic characteristics to classify genera and genes in the Geminiviridae family. All genera of the family Geminiviridae could be classified with high accuracy ( Silva et al, 2017b ) . However, these methods cannot classify the genomic components of begomoviruses.…”
Section: Introductionmentioning
confidence: 99%