2019
DOI: 10.1101/815480
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

The effect of variant interference on de novo assembly for viral deep sequencing

Abstract: Viruses have high mutation rates and generally exist as a mixture of variants in biological samples. Next-generation sequencing (NGS) approach has surpassed Sanger for generating long viral sequences, yet how variants affect NGS de novo assembly remains largely unexplored. Our results from >15,000 simulated experiments showed that presence of variants can turn an assembly of one genome into tens to thousands of contigs. This “variant interference” (VI) is highly consistent and reproducible by ten most used … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 46 publications
0
2
0
Order By: Relevance
“…Standardized bioinformatic pipelines for uncovering viruses via metagenomics are being developed, however assembling eukaryotic viral genomes from uncultured, multicellular, host-based systems is still an uncommon practice (Castro et al, 2019;Ponsero and Hurwitz, 2019;Schulz et al, 2020a). For our dataset, some techniques, such as metagenomic binning using nucleotide composition, contig abundance, and reference genomes (Schulz et al, 2020b), did not work for identifying viral contigs.…”
Section: Metagenomics and Uvig Analysis Suggests Corals Contain A Novmentioning
confidence: 97%
“…Standardized bioinformatic pipelines for uncovering viruses via metagenomics are being developed, however assembling eukaryotic viral genomes from uncultured, multicellular, host-based systems is still an uncommon practice (Castro et al, 2019;Ponsero and Hurwitz, 2019;Schulz et al, 2020a). For our dataset, some techniques, such as metagenomic binning using nucleotide composition, contig abundance, and reference genomes (Schulz et al, 2020b), did not work for identifying viral contigs.…”
Section: Metagenomics and Uvig Analysis Suggests Corals Contain A Novmentioning
confidence: 97%
“…Aside from the upstream viral enrichment process, there are several barriers in downstream bioinformatics analysis that limit viral community identification and characterization. High viral complexity and diversity in the environmental samples, particularly sewage samples, often lead to nonuniform sequencing depth resulting in loss of rare viral species signals and poor quality of viral genome recovery [26–28]. Recently, flow cytometry has been coupled with viral metagenomics in marine samples to discover tailed and very large viruses, which could be excluded in routine viral metagenomics [29–31].…”
Section: Introductionmentioning
confidence: 99%