2019
DOI: 10.1186/s12864-019-5490-y
|View full text |Cite
|
Sign up to set email alerts
|

Signal enrichment with strain-level resolution in metagenomes using topological data analysis

Abstract: Background A metagenome is a collection of genomes, usually in a micro-environment, and sequencing a metagenomic sample en masse is a powerful means for investigating the community of the constituent microorganisms. One of the challenges is in distinguishing between similar organisms due to rampant multiple possible assignments of sequencing reads, resulting in false positive identifications. We map the problem to a topological data analysis (TDA) framework that extracts… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
1
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…TDA and Applications TDA has several key advantages over traditional data analysis and dimensionality reduction techniques such as PCA, multi-dimensional scaling, and cluster analysis, owing to its robustness, visualization capabilities, and coordinate-free properties [15]. Since initial work on the Mapper framework by Singh et al [19], TDA has gained wider use with a demonstrated ability to glean insightful structural information in the context of many applications such as cancer [14], microbial ecology [8], wildfires [2], agricultural phenomics [12], and more. To the best of our knowledge, our effort represents one of the first to adopt TDA for epidemics.…”
Section: Related Workmentioning
confidence: 99%
“…TDA and Applications TDA has several key advantages over traditional data analysis and dimensionality reduction techniques such as PCA, multi-dimensional scaling, and cluster analysis, owing to its robustness, visualization capabilities, and coordinate-free properties [15]. Since initial work on the Mapper framework by Singh et al [19], TDA has gained wider use with a demonstrated ability to glean insightful structural information in the context of many applications such as cancer [14], microbial ecology [8], wildfires [2], agricultural phenomics [12], and more. To the best of our knowledge, our effort represents one of the first to adopt TDA for epidemics.…”
Section: Related Workmentioning
confidence: 99%
“…The first important contribution of the current paper is that we prove concrete stability results for harmonic persistence homology in the context of general filtrations of simplicial complexes (without any underlying assumption that the simplicial complex is approximating some Riemannian manifold). This is very important in applications (such as in genomics) where persistent homology methods are applied to "relationship" data as opposed to point-cloud data from some R n (see [27] and also [25,21,22,28] for recent representative examples of such applications). The stability results proved in this paper allow us to infer that harmonic persistence theory is applicable when topological data analysis is used outside the context of manifold learning (such as in genomics).…”
Section: Stabilitymentioning
confidence: 99%