2019
DOI: 10.1038/s41598-019-39847-2
|View full text |Cite
|
Sign up to set email alerts
|

Host Taxon Predictor - A Tool for Predicting Taxon of the Host of a Newly Discovered Virus

Abstract: Recent advances in metagenomics provided a valuable alternative to culture-based approaches for better sampling viral diversity. However, some of newly identified viruses lack sequence similarity to any of previously sequenced ones, and cannot be easily assigned to their hosts. Here we present a bioinformatic approach to this problem. We developed classifiers capable of distinguishing eukaryotic viruses from the phages achieving almost 95% prediction accuracy. The classifiers are wrapped in Host Taxon Predicto… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
30
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 33 publications
(30 citation statements)
references
References 58 publications
0
30
0
Order By: Relevance
“…Two recent studies employ k-mer based, k-NN classifiers (16) and deep learning (17) to predict host range for a small set of three wellstudied species directly from viral sequences. While those approaches are limited to those particular species and do not scale to viral host-range prediction in general, the Host Taxon Predictor (HTP) (18) uses logistic regression and support vector machines to predict if a novel virus infects bacteria, plants, vertebrates or arthropods. Yet, the authors argue that it is not possible to use HTP in a read-based manner; it requires long sequences of at least 3,000 nucleotides.…”
Section: Current Tools For Host Range Predictionmentioning
confidence: 99%
“…Two recent studies employ k-mer based, k-NN classifiers (16) and deep learning (17) to predict host range for a small set of three wellstudied species directly from viral sequences. While those approaches are limited to those particular species and do not scale to viral host-range prediction in general, the Host Taxon Predictor (HTP) (18) uses logistic regression and support vector machines to predict if a novel virus infects bacteria, plants, vertebrates or arthropods. Yet, the authors argue that it is not possible to use HTP in a read-based manner; it requires long sequences of at least 3,000 nucleotides.…”
Section: Current Tools For Host Range Predictionmentioning
confidence: 99%
“…Two recent studies employ k -mer based, k -NN classifiers ( 16 ), and deep learning ( 17 ) to predict host range for a small set of three well-studied species directly from viral sequences. While those approaches are limited to those particular species and do not scale to viral host-range prediction in general, the Host Taxon Predictor (HTP) ( 18 ) uses logistic regression and supports vector machines to predict if a novel virus infects bacteria, plants, vertebrates or arthropods. Yet, the authors argue that it is not possible to use HTP in a read-based manner; it requires long sequences of at least 3000 nucleotides.…”
Section: Introductionmentioning
confidence: 99%
“…However, DeePhage cannot identify these sequences before distinguishing the lifestyle of each contig. Fortunately, the related tool that helps to distinguish prokaryotic and eukaryotic viruses has been developed recently [48] and we are also considering constructing a preprocessing module for DeePhage to filter out the eukaryotic viruses so that DeePhage can generate more reliable results for the downstream analysis.…”
Section: Discussionmentioning
confidence: 99%