2016
DOI: 10.3389/fmicb.2016.00822
|View full text |Cite
|
Sign up to set email alerts
|

HoloVir: A Workflow for Investigating the Diversity and Function of Viruses in Invertebrate Holobionts

Abstract: Abundant bioinformatics resources are available for the study of complex microbial metagenomes, however their utility in viral metagenomics is limited. HoloVir is a robust and flexible data analysis pipeline that provides an optimized and validated workflow for taxonomic and functional characterization of viral metagenomes derived from invertebrate holobionts. Simulated viral metagenomes comprising varying levels of viral diversity and abundance were used to determine the optimal assembly and gene prediction s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
62
0

Year Published

2017
2017
2019
2019

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 47 publications
(62 citation statements)
references
References 84 publications
(115 reference statements)
0
62
0
Order By: Relevance
“…This degree of contamination was not evident when screening the reads against the complete SILVA database: 0.06% of the reads from the raw dataset and 0.1% of the reads from the filtered dataset mapped to rRNA genes at min bit-score 80, indicating negligible amounts of cellular contamination as defined by Roux et al (2013). Furthermore, at max 1.0E-10, only 577 of the 33296 predicted proteins (1.6%) mapped to cellular protein sequences from the extensive marker gene database published by Laffy et al (2016). …”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…This degree of contamination was not evident when screening the reads against the complete SILVA database: 0.06% of the reads from the raw dataset and 0.1% of the reads from the filtered dataset mapped to rRNA genes at min bit-score 80, indicating negligible amounts of cellular contamination as defined by Roux et al (2013). Furthermore, at max 1.0E-10, only 577 of the 33296 predicted proteins (1.6%) mapped to cellular protein sequences from the extensive marker gene database published by Laffy et al (2016). …”
Section: Resultsmentioning
confidence: 99%
“…Previous studies with simulated datasets had not addressed these particular data issues. To identify a suitable assembly program, we simulated datasets that mimicked the copper-site data challenges, and compared performance of five assemblers that had previously shown best results in metagenome reconstruction (de Cárcer et al, 2014; Vázquez-Castellanos et al, 2014; García-López et al, 2015; Laffy et al, 2016). Of particular interest was the assembly precision for different size sequences, which are expected in metagenomes.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations