2020
DOI: 10.1101/2020.09.04.282665
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

De novo virus inference and host prediction from metagenome using CRISPR spacers

Abstract: Viruses are the most numerous biological entity, existing in all environments and infecting all cellular organisms. Compared with cellular life, the evolution and origin of viruses are poorly understood; viruses are enormously diverse and most lack sequence similarity to cellular genes. To uncover viral sequences without relying on either reference viral sequences from databases or marker genes known to characterize specific viral taxa, we developed a virus inference method based on clustered regularly intersp… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 60 publications
(37 reference statements)
0
2
0
Order By: Relevance
“…Next, we tried to infer whether these putative phages were predicted to be active or not by comparing them with databases of CRISPR spacers derived from human gut metagenomes. To this end, we used a publicly available collection of spacers extracted from 11, 817 human gut metagenome datasets (64) and an in-house spacer database that was built by running CRISPRCasFinder (65) on the Integrative Human Microbiome Project – Inflammatory Bowel Disease metagenomic dataset (66) (Methods). By doing this, we obtained that 1346 out of 1447 dereplicated viral genomes harboring UG27 (93, 02%) were targeted by at least 1 spacer, with 1197 (82%) being targeted by 5 or more spacers, suggesting a recent active role of these viral genomes in their natural environment.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Next, we tried to infer whether these putative phages were predicted to be active or not by comparing them with databases of CRISPR spacers derived from human gut metagenomes. To this end, we used a publicly available collection of spacers extracted from 11, 817 human gut metagenome datasets (64) and an in-house spacer database that was built by running CRISPRCasFinder (65) on the Integrative Human Microbiome Project – Inflammatory Bowel Disease metagenomic dataset (66) (Methods). By doing this, we obtained that 1346 out of 1447 dereplicated viral genomes harboring UG27 (93, 02%) were targeted by at least 1 spacer, with 1197 (82%) being targeted by 5 or more spacers, suggesting a recent active role of these viral genomes in their natural environment.…”
Section: Resultsmentioning
confidence: 99%
“…Host taxonomy prediction of UG27 viral genomes was done by performing CRISPR spacers comparisons against a reference database using CrisprOpenDB (63). As most of the taxonomy predictions pointed out towards bacteria present in the gut microbiome, viral genomes were compared against recent human gut metagenomic spacers database (64) and a custom collection of spacers found in CRISPR arrays from the human gut microbiome; compiled by running CRISPRCasFinder (65) on all metagenomic contigs from the HMP2-IBD database (66). For the comparison, BLASTn (67) with -task “blastn-short” option was used, and only hits with ≤2 mismatches and >95% sequence identity were kept.…”
Section: Methodsmentioning
confidence: 99%