2016
DOI: 10.1101/036392
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Long single-molecule reads can resolve the complexity of the Influenza virus composed of rare, closely related mutant variants

Abstract: Abstract. As a result of a high rate of mutations and recombination events, an RNA-virus exists as a heterogeneous "swarm" of mutant variants. The long read length offered by single-molecule sequencing technologies allows each mutant variant to be sequenced in a single pass. However, high error rate limits the ability to reconstruct heterogeneous viral population composed of rare, related mutant variants. In this paper, we present 2SNV, a method able to tolerate the high error-rate of the single-molecule proto… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 8 publications
(12 citation statements)
references
References 44 publications
0
12
0
Order By: Relevance
“…1: Finding linked SNV pairs. CliqueSNV uses pairs of linked SNVs which have been previously introduced for the 2SNV method [3]. Let the major (minor) allele at a given genomic position be the allele observed in the majority (minority) of reads covering this position.…”
Section: Cliquesnv Algorithmmentioning
confidence: 99%
See 3 more Smart Citations
“…1: Finding linked SNV pairs. CliqueSNV uses pairs of linked SNVs which have been previously introduced for the 2SNV method [3]. Let the major (minor) allele at a given genomic position be the allele observed in the majority (minority) of reads covering this position.…”
Section: Cliquesnv Algorithmmentioning
confidence: 99%
“…is the largest probability of observing the 2-haplotypes (22) among these n reads given that the variant (22) does not exist. It has been shown in [3] that after Bonferroni correction to multiple testing the value of p should satisfy the following inequality…”
Section: Cliquesnv Algorithmmentioning
confidence: 99%
See 2 more Smart Citations
“…In these cases, instead of obtaining the golden standard, one can design a mock community (often referred as a synthetic mock community) by combining in vitro titrated proportions of community elements. The most popular mock communities are prepared as mixtures of known microbial organisms 31,32 . When microbial organisms are closely related with similar sequences, such as intra-host RNA-virus populations, one should challenge computational methods with various frequency profiles and include closely related pairs 19,[32][33][34] .…”
Section: Defining Gold Standardsmentioning
confidence: 99%