2021
DOI: 10.1021/acs.jpcb.1c02081
|View full text |Cite
|
Sign up to set email alerts
|

UMAP as a Dimensionality Reduction Tool for Molecular Dynamics Simulations of Biomacromolecules: A Comparison Study

Abstract: Proteins are the molecular machines of life. The multitude of possible conformations that proteins can adopt determines their free-energy landscapes. However, the inherently high dimensionality of a protein free-energy landscape poses a challenge to deciphering how proteins perform their functions. For this reason, dimensionality reduction is an active field of research for molecular biologists. The uniform manifold approximation and projection (UMAP) is a dimensionality reduction method based on a fuzzy topol… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
57
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1
1
1

Relationship

1
9

Authors

Journals

citations
Cited by 64 publications
(58 citation statements)
references
References 64 publications
1
57
0
Order By: Relevance
“…The top 1500 dispersed genes were used to perform PCA using the RunPCA function. To reduce the dimension, we chose UMAP (Uniform Manifold Approximation and Projection), which has a similar visualization quality to the tSNE (t-Distributed Stochastic Neighbor Embedding) but may preserve more of the global structure [ 26 ]. Through UMAP analyses, the dimension of the first 25 principal components was reduced into 2 components using the RunUMAP function with the parameters n.neighbors = 50 and min.dist = 0.1.…”
Section: Methodsmentioning
confidence: 99%
“…The top 1500 dispersed genes were used to perform PCA using the RunPCA function. To reduce the dimension, we chose UMAP (Uniform Manifold Approximation and Projection), which has a similar visualization quality to the tSNE (t-Distributed Stochastic Neighbor Embedding) but may preserve more of the global structure [ 26 ]. Through UMAP analyses, the dimension of the first 25 principal components was reduced into 2 components using the RunUMAP function with the parameters n.neighbors = 50 and min.dist = 0.1.…”
Section: Methodsmentioning
confidence: 99%
“…The correlation-based metrics have been widely applied in the comparison between dimensionality reduction methods for biomolecules ( Tian and Tao, 2020 ; Trozzi et al, 2021 ). They are used here for the encoder module to measure how well the information is preserved in the latent space.…”
Section: Methodsmentioning
confidence: 99%
“…To elucidate the biological phenotype regulated by the NMRGs in the blood of ALS patients, the unsupervised consensus clustering (R package: ConsensuClusterPlus) (Wilkerson and Hayes, 2010) was used to obtain clusters based on "pam" method with 1,000 iterations and resample rate of 80%. Principal component analysis (PCA) and uniform manifold approximation and projection for dimension reduction (UMAP) (Trozzi et al, 2021) methods were used to evaluate gene expression patterns in peripheral blood of different subclusters of ALS. The GSEA software version: 4.1.0 (https://www.gsea-msigdb.org/gsea/index.…”
Section: Additional Bioinformatic Analysesmentioning
confidence: 99%