Biomolecular force fields have been traditionally derived based on a mixture of reference quantum chemistry data and experimental information obtained on small fragments. However, the possibility to run extensive molecular dynamics simulations on larger systems achieving ergodic sampling is paving the way to directly using such simulations along with solution experiments obtained on macromolecular systems. Recently, a number of methods have been introduced to automatize this approach. Here, we review these methods, highlight their relationship with machine learning methods, and discuss the open challenges in the field.
Atomistic molecular dynamics simulations represent an established technique for investigation of RNA structural dynamics. Despite continuous development, contemporary RNA simulations still suffer from suboptimal accuracy of empirical potentials (force fields, ffs) and sampling limitations. Development of efficient enhanced sampling techniques is important for two reasons. First, they allow us to overcome the sampling limitations, and second, they can be used to quantify ff imbalances provided they reach a sufficient convergence. Here, we study two RNA tetraloops (TLs), namely the GAGA and UUCG motifs. We perform extensive folding simulations and calculate folding free energies (ΔG fold °) with the aim to compare different enhanced sampling techniques and to test several modifications of the nonbonded terms extending the AMBER OL3 RNA ff. We demonstrate that replica-exchange solute tempering (REST2) simulations with 12−16 replicas do not show any sign of convergence even when extended to a timescale of 120 μs per replica. However, the combination of REST2 with well-tempered metadynamics (ST-MetaD) achieves good convergence on a timescale of 5−10 μs per replica, improving the sampling efficiency by at least 2 orders of magnitude. Effects of ff modifications on ΔG fold °energies were initially explored by the reweighting approach and then validated by new simulations. We tested several manually prepared variants of the gHBfix potential which improve stability of the native state of both TLs by ∼2 kcal/mol. This is sufficient to conveniently stabilize the folded GAGA TL while the UUCG TL still remains under-stabilized. Appropriate adjustment of van der Waals parameters for C−H•••O5′ base-phosphate interaction may further stabilize the native states of both TLs by ∼0.6 kcal/mol.
The
capability of
current force fields to reproduce RNA structural
dynamics is limited. Several methods have been developed to take advantage
of experimental data in order to enforce agreement with experiments.
Here, we extend an existing framework which allows arbitrarily chosen
force-field correction terms to be fitted by quantification of the
discrepancy between observables back-calculated from simulation and
corresponding experiments. We apply a robust regularization protocol
to avoid overfitting and additionally introduce and compare a number
of different regularization strategies, namely, L1, L2, Kish size,
relative Kish size, and relative entropy penalties. The training set
includes a GACC tetramer as well as more challenging systems, namely,
gcGAGAgc and gcUUCGgc RNA tetraloops. Specific intramolecular hydrogen
bonds in the AMBER RNA force field are corrected with automatically
determined parameters that we call gHBfix
opt
. A validation
involving a separate simulation of a system present in the training
set (gcUUCGgc) and new systems not seen during training (CAAU and
UUUU tetramers) displays improvements regarding the native population
of the tetraloop as well as good agreement with NMR experiments for
tetramers when using the new parameters. Then, we simulate folded
RNAs (a kink–turn and L1 stalk rRNA) including hydrogen bond
types not sufficiently present in the training set. This allows a
final modification of the parameter set which is named gHBfix21 and
is suggested to be applicable to a wider range of RNA systems.
Post-transcriptional modifications are crucial for RNA
function
and can affect its structure and dynamics. Force-field-based classical
molecular dynamics simulations are a fundamental tool to characterize
biomolecular dynamics, and their application to RNA is flourishing.
Here, we show that the set of force-field parameters for N6-methyladenosine (m6A) developed for the commonly used
AMBER force field does not reproduce duplex denaturation experiments
and, specifically, cannot be used to describe both paired and unpaired
states. Then, we use reweighting techniques to derive new parameters
matching available experimental data. The resulting force field can
be used to properly describe paired and unpaired m6A in
both syn and anti conformation,
which thus opens the way to the use of molecular simulations to investigate
the effects of N6 methylations on RNA structural dynamics.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.