2022
DOI: 10.1021/acs.jctc.1c01290
|View full text |Cite
|
Sign up to set email alerts
|

Size-and-Shape Space Gaussian Mixture Models for Structural Clustering of Molecular Dynamics Trajectories

Abstract: Determining the optimal number and identity of structural clusters from an ensemble of molecular configurations continues to be a challenge. Recent structural clustering methods have focused on the use of internal coordinates due to the innate rotational and translational invariance of these features. The vast number of possible internal coordinates necessitates a feature space supervision step to make clustering tractable but yields a protocol that can be system type-specific. Particle positions offer an appe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
62
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 33 publications
(64 citation statements)
references
References 58 publications
2
62
0
Order By: Relevance
“…In Fig. 4(c), we show the replacement of U with its average U (obtained over the averaging interval of [20,25] ns). We observe that MSM dynamics predicted using only 25 ns of the MD data set overestimates the equilibration rate, as was the case with alanine dipeptide and the argonaute complex, whereas the U -GME parameterized with the same amount of reference data accurately captures the MD data until ∼ 375 ns.…”
Section: Fip35 Ww-domainmentioning
confidence: 99%
See 2 more Smart Citations
“…In Fig. 4(c), we show the replacement of U with its average U (obtained over the averaging interval of [20,25] ns). We observe that MSM dynamics predicted using only 25 ns of the MD data set overestimates the equilibration rate, as was the case with alanine dipeptide and the argonaute complex, whereas the U -GME parameterized with the same amount of reference data accurately captures the MD data until ∼ 375 ns.…”
Section: Fip35 Ww-domainmentioning
confidence: 99%
“…As we mentioned in the Introduction, below we do not consider how one identifies these configurational basins (the interested reader can see, for instance, Refs. 1821, 2325, and 39), but rather focus on the second problem: given a set of configurations whose dynamics one can only afford for only short times, how does one construct a dynamical framework to accurately and efficiently capture the dynamics of these configurations over all time?…”
Section: Connecting Markovian and Non-markovian Evolutionmentioning
confidence: 99%
See 1 more Smart Citation
“…While work in the last few decades has produced a robust set of tools to exploit collections of short-time all-atom MD simulations of complex biomolecular systems to identify these elusive and important slow coordinates, this problem of identifying slow degrees of freedom remains an open question of fundamental importance to many fields. 11,[41][42][43][44][45][46][47][48][49] Once these slow coordinates are (perfectly or approximately) identified, one can employ a variety of tools to estimate the transition probabilities 11,42 and construct the simple equation of motion -which takes the form of a simple chemical kinetics problem -for the interconversion of these slow macrostates at the heart of the MSM: 10,42…”
Section: Generalized Master Equations: the Advantages Of Memorymentioning
confidence: 99%
“…Theoretically well-grounded dimensionality reduction (DR) techniques are now commonly being used in protein conformation analysis to extract the latent low dimensional features and the quantum of information lost during the projection depends heavily on the kind of data set under consideration 6772 . For example, a highly heterogeneous data set that lies on a high-dimensional manifold as in the case of IDPs is best handled with the non-linear dimension reduction (NLDR) techniques, which generally attempt to keep the nearest neighbors close together.…”
Section: Introductionmentioning
confidence: 99%