2023
DOI: 10.1021/acs.jctc.3c00840
|View full text |Cite
|
Sign up to set email alerts
|

DiAMoNDBack: Diffusion-Denoising Autoregressive Model for Non-Deterministic Backmapping of Cα Protein Traces

Michael S. Jones,
Kirill Shmilovich,
Andrew L. Ferguson

Abstract: Coarse-grained molecular models of proteins permit access to length and time scales unattainable by all-atom models and the simulation of processes that occur on long time scales, such as aggregation and folding. The reduced resolution realizes computational accelerations, but an atomistic representation can be vital for a complete understanding of mechanistic details. Backmapping is the process of restoring all-atom resolution to coarse-grained molecular models. In this work, we report DiAMoNDBack (Diffusion-… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 92 publications
1
6
0
Order By: Relevance
“…By comparing the quality of the generated data with original data, DDPMs were accurate in most cases up to using only 10 out of 31 windows and observing sustained accuracy while cutting the data down to one-third of what is traditionally required points to the strong ability of DDPMs to learn the underlying data distribution. This observation concurs with the strong performance of DDPMs in various tasks in various fields, including replica-exchange simulations, docking, protein structure prediction, ,, and other related tasks. , − Furthermore, in addition to the comparison with the ground truth established by all 31 window US simulations, we compared the performance of the DDPM-enhanced model with only 10 US data. As expected, the PMF curve generated from only 10 US data was rough and produced large error values for both Δ G -partitioning and Δ G -crossing (Figure S6).…”
Section: Discussionsupporting
confidence: 77%
“…By comparing the quality of the generated data with original data, DDPMs were accurate in most cases up to using only 10 out of 31 windows and observing sustained accuracy while cutting the data down to one-third of what is traditionally required points to the strong ability of DDPMs to learn the underlying data distribution. This observation concurs with the strong performance of DDPMs in various tasks in various fields, including replica-exchange simulations, docking, protein structure prediction, ,, and other related tasks. , − Furthermore, in addition to the comparison with the ground truth established by all 31 window US simulations, we compared the performance of the DDPM-enhanced model with only 10 US data. As expected, the PMF curve generated from only 10 US data was rough and produced large error values for both Δ G -partitioning and Δ G -crossing (Figure S6).…”
Section: Discussionsupporting
confidence: 77%
“…For example, Stieffenhofer et al trained generative adversarial networks to reconstruct protein structural ensembles. Similar tasks have been pursued with different architectures. , Our previous work seeks not only to add stochasticity to this task but also to ensure that the sampled distribution can be reweighted to ensure Boltzmann statistics; we carry this out by training normalizing flows that conditionally sample rotamers given a backbone conformation . However, unlike refs and , the models developed in ref trained for individual proteins and are not general purpose.…”
Section: Related Workmentioning
confidence: 99%
“…Due to the tractable structure of rotamer distributions, several groups have pursued efforts to systematically enumerate the available states. , Furthermore, these libraries have subsequently been deployed to capture the sequence dependence of side chains using relaxation algorithms. ,, Recently, this topic has been revisited in the context of more expressive probabilistic models built with deep neural networks. ,,, However, in large part, these efforts have not sought to add physically meaningful thermal stochastic fluctuations.…”
Section: Introductionmentioning
confidence: 99%
“…In practice, a full atom description of large macromolecular nanomachines is currently difficult to achieve and one must resort to coarse-grained models ( 5–7 ). Note that several methods have been developed to go from a coarse-grained representation of biological macromolecules back to their full atomic models ( 8–10 ).…”
Section: Introductionmentioning
confidence: 99%