2021
DOI: 10.1099/mgen.0.000607
|View full text |Cite
|
Sign up to set email alerts
|

SplitStrains, a tool to identify and separate mixed Mycobacterium tuberculosis infections from WGS data

Abstract: The occurrence of multiple strains of a bacterial pathogen such as M. tuberculosis or C. difficile within a single human host, referred to as a mixed infection, has important implications for both healthcare and public health. However, methods for detecting it, and especially determining the proportion and identities of the underlying st… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 14 publications
(19 citation statements)
references
References 39 publications
0
19
0
Order By: Relevance
“…We compared the accuracy of our new tool, MixInfect2, against previous methods ( MixInfect 6 , SplitStrains 16 and QuantTB 17 ) for detecting mixed infections and to estimate the major strain proportion from the dataset of 36 in vitro mixed samples and 12 non-mixed (‘pure’) strains. The average coverage in all these samples was relatively high, ranging from 356- to 482-fold.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…We compared the accuracy of our new tool, MixInfect2, against previous methods ( MixInfect 6 , SplitStrains 16 and QuantTB 17 ) for detecting mixed infections and to estimate the major strain proportion from the dataset of 36 in vitro mixed samples and 12 non-mixed (‘pure’) strains. The average coverage in all these samples was relatively high, ranging from 356- to 482-fold.…”
Section: Resultsmentioning
confidence: 99%
“…Here, we present a new tool for identifying multi- strain Mtb samples from WGS data, MixInfect2, that builds upon a previously a published method 6 to improve the accuracy of mixed infection detection. We compare this new method to three previous tools for identifying mixed infection (MixInfect 6 , SplitStrains 16 , and QuantTB 17 ) and then assess different approaches for reconstructing the constituent strain sequences of the 36 in vitro mixed samples at varying major and minor strain proportions. Finally, we apply the optimal approach to a real-world dataset of over 2,000 Mtb isolates collected from the Republic of Moldova between 2018 - 2019 to predict the constituent strain sequences of mixed samples and place these strains into putative transmission clusters.…”
Section: Introductionmentioning
confidence: 99%
“…To mimic the benchmark dataset used to evaluate QuantTB, we generated two standard datasets (ART-TBmix1 and ART-TBmix2) as per the simulation procedure described in [14][12] using ART simulator [33]. Both ART-TBmix1 and ART-TBmix2 consists of 800 samples at four distinct levels of coverage: 10x, 20x, 90x-10x, and 70x-30x, each with 200 samples.…”
Section: Methodsmentioning
confidence: 99%
“…Although this method is simple and does not require any overhead of keeping track of the new strains, it reveals only the strains’ proportions and not their identity (mutational profile), and also cannot robustly handle cases when strains are present in equal proportions. SplitStrains [14] is a new statistical-based method that aligns with MixInfect due to its non-reliance on the reference database. The procedure uses a likelihood ratio test to resolve the heterogeneity of the samples, followed by the assignment of reads to a strain using the Expectation-Maximization (EM) algorithm and Naive Bayes classifier, in order to finally report the strain proportions.…”
Section: Introductionmentioning
confidence: 99%
“…This technique can be applied for genotypic and phenotypic characterization of organisms and profiling of drug susceptibility [82,83] including detecting mutations in new drugs such as bedaquiline and delamanid [84]. The performance of WGS technology is further enhanced by incorporating a novel method 'SplitStrains' which helps to analyze WGS data of patients having mixed infections [85]. Due to high cost, the need of robust technologies, and technical expertise, initially, there was limited WGS utility in low-income countries [86].…”
Section: Whole Genome Sequencing (Wgs)mentioning
confidence: 99%