2020
DOI: 10.1101/2020.12.02.408310
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Prokaryotic virus Host Predictor: a Gaussian model for host prediction of prokaryotic viruses in metagenomics

Abstract: BackgroundViruses are ubiquitous biological entities, estimated to be the largest reservoirs of unexplored genetic diversity on Earth. Full functional characterization and annotation of newly-discovered viruses requires tools to enable taxonomic assignment, the range of hosts, and biological properties of the virus. Here we focus on prokaryotic viruses, which include phages and archaeal viruses, and for which identifying the viral host is an essential step in characterizing the virus, as the virus relies on th… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(21 citation statements)
references
References 28 publications
(25 reference statements)
0
21
0
Order By: Relevance
“…In this section, we will show our experimental results on different datasets and compare CHERRY against the stateof-the-art tools: WIsH [10], PHP [24], VHM-Net [12], VPF-Class [13], vHULK [19], RaFAH [21], and HostG [27].…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…In this section, we will show our experimental results on different datasets and compare CHERRY against the stateof-the-art tools: WIsH [10], PHP [24], VHM-Net [12], VPF-Class [13], vHULK [19], RaFAH [21], and HostG [27].…”
Section: Resultsmentioning
confidence: 99%
“…They also use a deep learning model and finally integrate the results of deep learning model with the Markov model for host prediction. On the other hand, PHP [24] utilizes the k-mer frequency, which can reflect the codon usage patterns shared by the viruses and the hosts [25,26]. HostG [27] utilizes the shared protein clusters between viruses and prokaryotes to create a knowledge graph and trains a graph convolutional network for prediction.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Several attempts have been made to predict hosts for phages based on the genomic sequences. They can be roughly divided into two groups: alignment-based [15,16] and learning-based [10,17,18] models. Alignment-based methods utilize sequence similarity search between query contigs and reference genomes of candidate hosts (bacteria).…”
Section: Related Workmentioning
confidence: 99%
“…First, both lytic and temperate phages can integrate the host genetic materials into their genomes, leading to local sequence similarities between the genomes of phages and bacteria [13]. For example, ~76% phages with known hosts in the RefSeq database have detectable alignments (E-value <1e-5) with their host genomes [14]. These common regions will pose challenges for distinguishing phages from their bacterial hosts.…”
Section: Introductionmentioning
confidence: 99%