We
present a supercomputer-driven pipeline for in silico drug discovery
using enhanced sampling molecular dynamics (MD) and ensemble docking.
Ensemble docking makes use of MD results by docking compound databases
into representative protein binding-site conformations, thus taking
into account the dynamic properties of the binding sites. We also
describe preliminary results obtained for 24 systems involving eight
proteins of the proteome of SARS-CoV-2. The MD involves temperature
replica exchange enhanced sampling, making use of massively parallel
supercomputing to quickly sample the configurational space of protein
drug targets. Using the Summit supercomputer at the Oak Ridge National
Laboratory, more than 1 ms of enhanced sampling MD can be generated
per day. We have ensemble docked repurposing databases to 10 configurations
of each of the 24 SARS-CoV-2 systems using AutoDock Vina. Comparison
to experiment demonstrates remarkably high hit rates for the top scoring
tranches of compounds identified by our ensemble approach. We also
demonstrate that, using Autodock-GPU on Summit, it is possible to
perform exhaustive docking of one billion compounds in under 24 h.
Finally, we discuss preliminary results and planned improvements to
the pipeline, including the use of quantum mechanical (QM), machine
learning, and artificial intelligence (AI) methods to cluster MD trajectories
and rescore docking poses.
We present a supercomputer-driven pipeline for <i>in-silico</i> drug discovery using enhanced sampling molecular dynamics (MD) and ensemble docking. We also describe preliminary results obtained for 23 systems involving eight protein targets of the proteome of SARS CoV-2. THe MD performed is temperature replica-exchange enhanced sampling, making use of the massively parallel supercomputing on the SUMMIT supercomputer at Oak Ridge National Laboratory, with which more than 1ms of enhanced sampling MD can be generated per day. We have ensemble docked repurposing databases to ten configurations of each of the 23 SARS CoV-2 systems using AutoDock Vina. We also demonstrate that using Autodock-GPU on SUMMIT, it is possible to perform exhaustive docking of one billion compounds in under 24 hours. Finally, we discuss preliminary results and planned improvements to the pipeline, including the use of quantum mechanical (QM), machine learning, and AI methods to cluster MD trajectories and rescore docking poses.
Time-to-solution for structure-based screening of massive chemical databases for COVID-19 drug discovery has been decreased by an order of magnitude, and a virtual laboratory has been deployed at scale on up to 27,612 GPUs on the Summit supercomputer, allowing an average molecular docking of 19,028 compounds per second. Over one billion compounds were docked to two SARS-CoV-2 protein structures with full optimization of ligand position and 20 poses per docking, each in under 24 hours. GPU acceleration and high-throughput optimizations of the docking program produced 350× mean speedup over the CPU version (50× speedup per node). GPU acceleration of both feature calculation for machine-learning based scoring and distributed database queries reduced processing of the 2.4 TB output by orders of magnitude. The resulting 50× speedup for the full pipeline reduces an initial 43 day runtime to 21 hours per protein for providing high-scoring compounds to experimental collaborators for validation assays.
We present a supercomputer-driven pipeline for <i>in-silico</i> drug discovery using enhanced sampling molecular dynamics (MD) and ensemble docking. We also describe preliminary results obtained for 23 systems involving eight protein targets of the proteome of SARS CoV-2. THe MD performed is temperature replica-exchange enhanced sampling, making use of the massively parallel supercomputing on the SUMMIT supercomputer at Oak Ridge National Laboratory, with which more than 1ms of enhanced sampling MD can be generated per day. We have ensemble docked repurposing databases to ten configurations of each of the 23 SARS CoV-2 systems using AutoDock Vina. We also demonstrate that using Autodock-GPU on SUMMIT, it is possible to perform exhaustive docking of one billion compounds in under 24 hours. Finally, we discuss preliminary results and planned improvements to the pipeline, including the use of quantum mechanical (QM), machine learning, and AI methods to cluster MD trajectories and rescore docking poses.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.