2018
DOI: 10.1007/978-3-319-89593-2_6
|View full text |Cite
|
Sign up to set email alerts
|

Scaffoldings and Spines: Organizing High-Dimensional Data Using Cover Trees, Local Principal Component Analysis, and Persistent Homology

Abstract: We propose a flexible and multi-scale method for organizing, visualizing, and understanding datasets sampled from or near stratified spaces. The first part of the algorithm produces a cover tree using adaptive thresholds based on a combination of multiscale local principal component analysis and topological data analysis. The resulting cover tree nodes consist of points within or near the same stratum of the stratified space. They are then connected to form a scaffolding graph, which is then simplified and col… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(14 citation statements)
references
References 18 publications
0
14
0
Order By: Relevance
“…One particularly useful tool for this analysis is 1-dimensional persistent homology [32,33], which encodes how circular structures persist over the course of a filtration in a topological signature called a persistence diagram. This and its variants have been quite successful in applications, particularly for the analysis of periodicity [34][35][36][37][38][39][40][41], including for parameter selection [42,43], data clustering [44], machining dynamics [45][46][47][48][49], gene regulatory systems [50,51], financial data [52][53][54], wheeze detection [55], sonar classification [56], video analysis [57][58][59], and annotation of song structure [60,61].…”
Section: Introductionmentioning
confidence: 99%
“…One particularly useful tool for this analysis is 1-dimensional persistent homology [32,33], which encodes how circular structures persist over the course of a filtration in a topological signature called a persistence diagram. This and its variants have been quite successful in applications, particularly for the analysis of periodicity [34][35][36][37][38][39][40][41], including for parameter selection [42,43], data clustering [44], machining dynamics [45][46][47][48][49], gene regulatory systems [50,51], financial data [52][53][54], wheeze detection [55], sonar classification [56], video analysis [57][58][59], and annotation of song structure [60,61].…”
Section: Introductionmentioning
confidence: 99%
“…Zigzag [43,44] and multidimensional [45] persistence are, for instance, promising methods for the analysis of temporal genomic data. Recent advances in dimensional reduction leveraging the modularity of topologically stratified spaces [46] will probably result in valuable tools for the analysis of genomic data. In summary, a rich interplay between formal developments and new applications is expected in upcoming years, which may place TDA in the standard toolbox of computational biology.…”
Section: Discussionmentioning
confidence: 99%
“…While the two notions are closely related, the emphasis of stratified spaces is topological. Bendich, Gasparovic, Tralie and Harer [9] used stratified spaces to develop a heuristic approach for partitioning the space. Their approach is both similar and complementary to the partitioning approach used in our methodology.…”
Section: Stratified Space Constructionmentioning
confidence: 99%