Fast Fourier transform (FFT)-based protein ligand docking together with parallel simulated annealing for both rigid and flexible receptor docking are implemented on graphical processing unit (GPU) accelerated platforms to significantly enhance the throughput of the CDOCKER and flexible CDOCKER the docking algorithms in the CHARMM program for biomolecule modeling. The FFT-based approach for docking, first applied in protein–protein docking to efficiently search for the binding position and orientation of proteins, is adapted here to search ligand translational and rotational spaces given a ligand conformation in protein–ligand docking. Running on GPUs, our implementation of FFT docking in CDOCKER achieves a 15 000 fold speedup in the ligand translational and rotational space search in protein–ligand docking problems. With this significant speedup it becomes practical to exhaustively search ligand translational and rotational space when docking a rigid ligand into a protein receptor. We demonstrate in this paper that this provides an efficient way to calculate an upper bound for docking accuracy in the assessment of scoring functions for protein–ligand docking, which can be useful for improving scoring functions. The parallel molecular dynamics (MD) simulated annealing, also running on GPUs, aims to accelerate the search algorithm in CDOCKER by running MD simulated annealing in parallel on GPUs. When utilized as part of the general CDOCKER docking protocol, acceleration in excess of 20 times is achieved. With this acceleration, we demonstrate that the performance of CDOCKER for redocking is significantly improved compared with three other popular protein–ligand docking programs on two widely used protein ligand complex data sets: the Astex diverse set and the SB2012 test set. The flexible CDOCKER is similarly improved by the parallel MD simulated annealing on GPUs. Based on the results presented here, we suggest that the accelerated CDOCKER platform provides a highly competitive docking engine for both rigid-receptor and flexible-receptor docking studies and will further facilitate continued improvement in the physics-based scoring function employed in CDOCKER docking studies.
The binding of small-molecule ligands to protein or nucleic acid targets is important to numerous biological processes. Accurate prediction of the binding modes between a ligand and a macromolecule is of fundamental importance in structure-based structure–function exploration. When multiple ligands with different sizes are docked to a target receptor, it is reasonable to assume that the residues in the binding pocket may adopt alternative conformations upon interacting with the different ligands. In addition, it has been suggested that the entropic contribution to binding can be important. However, only a few attempts to include the side chain conformational entropy upon binding within the application of flexible receptor docking methodology exist. Here, we propose a new physics-based scoring function that includes both enthalpic and entropic contributions upon binding by considering the conformational variability of the flexible side chains within the ensemble of docked poses. We also describe a novel hybrid searching algorithm that combines both molecular dynamics (MD)-based simulated annealing and genetic algorithm crossovers to address the enhanced sampling of the increased search space. We demonstrate improved accuracy in flexible cross-docking experiments compared with rigid cross-docking. We test our developments by considering five protein targets, thrombin, dihydrofolate reductase(DHFR), T4 L99A, T4 L99A/M102Q, and PDE10A, which belong to different enzyme classes with different binding pocket environments, as a representative set of diverse ligands and receptors. Each target contains dozens of different ligands bound to the same binding pocket. We also demonstrate that this flexible docking algorithm may be applicable to RNA docking with a representative riboswitch example. Our findings show significant improvements in top ranking accuracy across this set, with the largest improvement relative to rigid, 23.64%, occurring for ligands binding to DHFR. We then evaluate the ability to identify lead compounds among a large chemical space for the proposed flexible receptor docking algorithm using a subset of the DUD-E containing receptor targets MCR, GCR, and ANDR. We demonstrate that our new algorithms show improved performance in modeling flexible binding site residues compared to DOCK. Finally, we select the T4 L99A and T4 L99A/M102Q decoy sets, containing dozens of binders and experimentally validated nonbinders, to test our approach in distinguishing binders from nonbinders. We illustrate that our new algorithms for searching and scoring have superior performance to rigid receptor CDOCKER as well as AutoDock Vina. Finally, we suggest that flexible CDOCKER is sufficiently fast to be utilized in high-throughput docking screens in the context of hierarchical approaches.
The COVID-19 pandemic has highlighted the need for new antiviral approaches because many of the currently approved drugs have proven ineffective against mitigating SARS-CoV-2 infections. The host transmembrane serine protease TMPRSS2 is a promising antiviral target because it plays a role in priming the spike protein before viral entry occurs for the most virulent variants. Further, TMPRSS2 has no established physiological role, thereby increasing its attractiveness as a target for antiviral agents. Here, we utilize virtual screening to curate large libraries into a focused collection of potential inhibitors. Optimization of a recombinant expression and purification protocol for the TMPRSS2 peptidase domain facilitates subsequent biochemical screening and characterization of selected compounds from the curated collection in a kinetic assay. In doing so, we identify new noncovalent TMPRSS2 inhibitors that block SARS-CoV-2 infectivity in a cellular model. One such inhibitor, debrisoquine, has high ligand efficiency, and an initial structure− activity relationship study demonstrates that debrisoquine is a tractable hit compound for TMPRSS2.
CHARMM is rich in methodology and functionality as one of the first programs addressing problems of molecular dynamics and modeling of biological macromolecules and their partners, e.g., small molecule ligands. When combined with the highly developed CHARMM parameters for proteins, nucleic acids, small molecules, lipids, sugars, and other biologically relevant building blocks, and the versatile CHARMM scripting language, CHARMM has been a trendsetting platform for modeling studies of biological macromolecules. To further enhance the utility of accessing and using CHARMM functionality in increasingly complex workflows associated with modeling biological systems, we introduce pyCHARMM, Python bindings, functions, and modules to complement and extend the extensive set of modeling tools and methods already available in CHARMM. These include access to CHARMM function-generated variables associated with the system (psf), coordinates, velocities and forces, atom selection variables, and force field related parameters. The ability to augment CHARMM forces and energies with energy terms or methods derived from machine learning or other sources, written in Python, CUDA, or OpenCL and expressed as Python callable routines is introduced together with analogous functions callable during dynamics calculations. Integration of Python-based graphical engines for visualization of simulation models and results is also accessible. Loosely coupled parallelism is available for workflows such as free energy calculations, using MBAR/TI approaches or high-throughput multisite λ-dynamics (MSλD) free energy methods, string path optimization calculations, replica exchange, and molecular docking with a new Python-based CDOCKER module. CHARMM accelerated platform kernels through the CHARMM/OpenMM API, CHARMM/DOMDEC, and CHARMM/BLaDE API are also readily integrated into this Python framework. We anticipate that pyCHARMM will be a robust platform for the development of comprehensive and complex workflows utilizing Python and its extensive functionality as well as an optimal platform for users to learn molecular modeling methods and practices within a Python-friendly environment such as Jupyter Notebooks.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.