2020
DOI: 10.1021/acs.jctc.0c00676
|View full text |Cite
|
Sign up to set email alerts
|

An Information-Theory-Based Approach for Optimal Model Reduction of Biomolecules

Abstract: In theoretical modeling of a physical system, a crucial step consists of the identification of those degrees of freedom that enable a synthetic yet informative representation of it. While in some cases this selection can be carried out on the basis of intuition and experience, straightforward discrimination of the important features from the negligible ones is difficult for many complex systems, most notably heteropolymers and large biomolecules. We here present a thermodynamics-based theoretical framework to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
87
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 56 publications
(112 citation statements)
references
References 118 publications
0
87
0
Order By: Relevance
“…In this section, we outline the technical ingredients that lie at the basis of the results obtained in this study. Specifically, in Mapping entropy we summarize the mapping entropy protocol for optimizing CG representations presented in Giulini et al (2020) ; in Protein structures and data sets we briefly describe the two proteins analyzed in this work as well as the data sets fed to the machine learning architecture; in Data Representation and Machine Learning model we illustrate our choice for the representation of the input data, together with theoretical and computational details about DGNs; finally, in Wang–Landau Sampling we describe our implementation of the Wang–Landau sampling algorithm as applied to the reconstruction of the mapping entropy landscape of a system.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…In this section, we outline the technical ingredients that lie at the basis of the results obtained in this study. Specifically, in Mapping entropy we summarize the mapping entropy protocol for optimizing CG representations presented in Giulini et al (2020) ; in Protein structures and data sets we briefly describe the two proteins analyzed in this work as well as the data sets fed to the machine learning architecture; in Data Representation and Machine Learning model we illustrate our choice for the representation of the input data, together with theoretical and computational details about DGNs; finally, in Wang–Landau Sampling we describe our implementation of the Wang–Landau sampling algorithm as applied to the reconstruction of the mapping entropy landscape of a system.…”
Section: Methodsmentioning
confidence: 99%
“…A recently developed statistical mechanics-based strategy that aims at overcoming such limitations is the one relying on the minimization of the mapping entropy ( Giulini et al, 2020 ), which performs, in an unsupervised manner, the identification of the subset of a molecule’s atoms that retains the largest possible amount of information about its behavior. This scheme relies on the calculation of the mapping entropy S map ( Shell, 2008 ; Rudzinski and Noid, 2011 ; Shell, 2012 ; Foley et al, 2015 ), a quantity that provides a measure of the dissimilarity between the probability density of the system configurations in the original, high-resolution description and the one marginalized over the discarded atoms.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Typically one chooses CG sites according to the center-of-masses (COM) of the atom groups forming the CG units, while other choices (such as taking coordinates of specific atoms) are also possible. For instance, one can aim to minimize the information loss due to the mapping operation (Giulini et al, 2020 ), or choose beads representing collective motions (Zhang et al, 2008 ). The Hamiltonian H ( q 1 , ⋯ , q n ) defines all properties of the high-resolution system and, through the mapping functions (Equation 1), all properties of the CG system.…”
Section: Bottom-up Coarse-grainingmentioning
confidence: 99%