2020
DOI: 10.3389/fgene.2020.620143
|View full text |Cite
|
Sign up to set email alerts
|

A t-SNE Based Classification Approach to Compositional Microbiome Data

Abstract: As a data-driven dimensionality reduction and visualization tool, t-distributed stochastic neighborhood embedding (t-SNE) has been successfully applied to a variety of fields. In recent years, it has also received increasing attention for classification and regression analysis. This study presented a t-SNE based classification approach for compositional microbiome data, which enabled us to build classifiers and classify new samples in the reduced dimensional space produced by t-SNE. The Aitchison distance was … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
23
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 36 publications
(24 citation statements)
references
References 33 publications
1
23
0
Order By: Relevance
“…To compare microbial communities among groups, beta-diversity was calculated to reflect the dissimilarity between groups, including unweighted unifrac [ 44 ] and weighted unifrac distance [ 45 ]. The data in this distance matrix were further visualized by t-SNE (t-distributed stochastic neighborhood embedding), a nonlinear dimension reduction method [ 46 ], To define potential biomarkers with statistical differences among groups, metagenomic features were detected by linear discriminant analysis (LDA) effect size (LEfSe) [ 47 ]. In order to emphasize the difference of the dominant species among three samples in each taxonomic rank, the top 10 species were selected and the ternary plot was drawn based on relative abundance [ 48 ].…”
Section: Methodsmentioning
confidence: 99%
“…To compare microbial communities among groups, beta-diversity was calculated to reflect the dissimilarity between groups, including unweighted unifrac [ 44 ] and weighted unifrac distance [ 45 ]. The data in this distance matrix were further visualized by t-SNE (t-distributed stochastic neighborhood embedding), a nonlinear dimension reduction method [ 46 ], To define potential biomarkers with statistical differences among groups, metagenomic features were detected by linear discriminant analysis (LDA) effect size (LEfSe) [ 47 ]. In order to emphasize the difference of the dominant species among three samples in each taxonomic rank, the top 10 species were selected and the ternary plot was drawn based on relative abundance [ 48 ].…”
Section: Methodsmentioning
confidence: 99%
“…It plots the similar data points on the same map as close as possible. Therefore, tSNE is often used for microbiome analysis as a popular new ordination technique [21][22][23][24].…”
Section: Discussionmentioning
confidence: 99%
“…Kaplan–Meier curves and receiver operating characteristics (ROC) curves were performed to assess the sensitivity and specificity of NRGsig. In addition, investigation was performed using principal component analysis (PCA) and T-distributed neighbor embedding (T-SNE) to analyze whether the prognostic model might properly categorize patients into two risk groups 25 . In addition, the GSE84437 cohort was used as an external validation set to confirm the model’s predictive value.…”
Section: Methodsmentioning
confidence: 99%