Computational chemistry and other
simulation fields are critically
dependent on computing resources, but few problems scale efficiently
to the hundreds of thousands of processors available in current supercomputersparticularly
for molecular dynamics. This has turned into a bottleneck as new hardware
generations primarily provide more processing units rather than making
individual units much faster, which simulation applications are addressing
by increasingly focusing on sampling with algorithms such as free-energy
perturbation, Markov state modeling, metadynamics, or milestoning.
All these rely on combining results from multiple simulations into
a single observation. They are potentially powerful approaches that
aim to predict experimental observables directly, but this comes at
the expense of added complexity in selecting sampling strategies and
keeping track of dozens to thousands of simulations and their dependencies.
Here, we describe how the distributed execution framework Copernicus
allows the expression of such algorithms in generic workflows: dataflow programs. Because dataflow algorithms explicitly state
dependencies of each constituent part, algorithms only need to be
described on conceptual level, after which the execution is maximally
parallel. The fully automated execution facilitates the optimization
of these algorithms with adaptive sampling, where
undersampled regions are automatically detected and targeted without
user intervention. We show how several such algorithms can be formulated
for computational chemistry problems, and how they are executed efficiently
with many loosely coupled simulations using either distributed or
parallel resources with Copernicus.
Biomolecular simulation is a core application on supercomputers, but it is exceptionally difficult to achieve the strong scaling necessary to reach biologically relevant timescales. Here, we present a new paradigm for parallel adaptive molecular dynamics and a publicly available implementation: Copernicus. This framework combines performance-leading molecular dynamics parallelized on three levels (SIMD, threads, and message-passing) with kinetic clustering, statistical model building and real-time result monitoring. Copernicus enables execution as single parallel jobs with automatic resource allocation. Even for a small protein such as villin (9,864 atoms), Copernicus exhibits near-linear strong scaling from 1 to 5,376 AMD cores. Starting from extended chains we observe structures 0.6Å from the native state within 30h, and achieve sufficient sampling to predict the native state without a priori knowledge after 80-90h. To match Copernicus' efficiency, a classical simulation would have to exceed 50 microseconds per day, currently infeasible even with custom hardware designed for simulations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.