DynamicBind: Predicting ligand-specific protein-ligand complex structure with a deep equivariant generative model

Lu, Wei; Zhang, Jixian; Huang, Weifeng; Zhang, Ziqiao; Jia, Xiangyu; Wang, Zhenyu; Shi, Leilei; Li, Chengtao; Wolynes, Peter G.; Zheng, Shuangjia

doi:10.21203/rs.3.rs-3225151/v1

Cited by 3 publications

(13 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, when dealing with scenarios where only the unbound protein structure (apo) is available or when protein structures are predicted using AlphaFold2, a significant drop in performance is observed for KarmaDock and other DLLD methods. Since then, a series of flexible docking methodologies , and several de novo LD methods − (Figure D) have been developed. These methods are able to generate ligand binding poses and modify protein conformations or generate the conformations for proteins and ligands from sequences at the cost of speed.…”

Section: The Contemporary Landscapementioning

confidence: 99%

“…Currently, a few DL strategies, ,,, such as KarmaDock, have successfully addressed these aspects, demonstrating enhanced docking and screening power on CASF-2016. Specifically, KarmaDock stands out not only for its docking module designed to predict binding poses but also for its integration of an MDN module tailored to PL distances, thereby facilitating a more accurate assessment of binding strength.…”

Section: Challengesmentioning

confidence: 99%

“…From a flexibility standpoint, the initial focus was on semiflexible DLLD methods, deemed sufficient for most VS scenarios and aligned with the design of most traditional LD programs. Currently, there is a growing interest in developing flexible docking methodologies and de novo methods that generate PL complex conformations directly from sequences. − This interest is driven by the scenarios where only unbound protein structures (apo structures), available through databases like PDB or predicted by tools like AlphaFold2, RosettaFold, and ESMFold, are accessible. These unbound structures may significantly differ from their bound counterparts (holo structures), and semiflexible DLLD methods, often trained on data sets comprising holo structures, have limited generalizability to apo structures.…”

Section: Challengesmentioning

confidence: 99%

“…The majority of existing DLLD methodologies ,− ,, including KarmaDock suffer from inadequate generalization capabilities, resulting in excellent performance on samples resembling the training set but decreased effectiveness or the generation of physically implausible conformations on out-of-distribution samples. Therefore, enhancing the generalization capacity of the DLLD approaches is crucial.…”

Section: Prospectsmentioning

confidence: 99%

See 3 more Smart Citations

Advancing Ligand Docking through Deep Learning: Challenges and Prospects in Virtual Screening

Zhang,

Shen,

Zhang

et al. 2024

Acc. Chem. Res.

View full text Add to dashboard Cite

Metrics & MoreArticle Recommendations CONSPECTUS: Molecular docking, also termed ligand docking (LD), is a pivotal element of structure-based virtual screening (SBVS) used to predict the binding conformations and affinities of protein−ligand complexes. Traditional LD methodologies rely on a search and scoring framework, utilizing heuristic algorithms to explore binding conformations and scoring functions to evaluate binding strengths. However, to meet the efficiency demands of SBVS, these algorithms and functions are often simplified, prioritizing speed over accuracy.The emergence of deep learning (DL) has exerted a profound impact on diverse fields, ranging from natural language processing to computer vision and drug discovery. DeepMind's AlphaFold2 has impressively exhibited its ability to accurately predict protein structures solely from amino acid sequences, highlighting the remarkable potential of DL in conformation prediction. This groundbreaking advancement circumvents the traditional search-scoring frameworks in LD, enhancing both accuracy and processing speed and thereby catalyzing a broader adoption of DL algorithms in binding pose prediction. Nevertheless, a consensus on certain aspects remains elusive.In this Account, we delineate the current status of employing DL to augment LD within the VS paradigm, highlighting our contributions to this domain. Furthermore, we discuss the challenges and future prospects, drawing insights from our scholarly investigations. Initially, we present an overview of VS and LD, followed by an introduction to DL paradigms, which deviate significantly from traditional search-scoring frameworks. Subsequently, we delve into the challenges associated with the development of DL-based LD (DLLD), encompassing evaluation metrics, application scenarios, and physical plausibility of the predicted conformations. In the evaluation of LD algorithms, it is essential to recognize the multifaceted nature of the metrics. While the accuracy of binding pose prediction, often measured by the success rate, is a pivotal aspect, the scoring/screening power and computational speed of these algorithms are equally important given the pivotal role of LD tools in VS. Regarding application scenarios, early methods focused on blind docking, where the binding site is unknown. However, recent studies suggest a shift toward identifying binding sites rather than solely predicting binding poses within these models. In contrast, LD with a known pocket in VS has been shown to be more practical. Physical plausibility poses another significant challenge. Although DLLD models often achieve higher success rates compared to traditional methods, they may generate poses with implausible local structures, such as incorrect bond angles or lengths, which are disadvantageous for postprocessing tasks like visualization. Finally, we discuss the future perspectives for DLLD, emphasizing the need to improve generalization ability, strike a balance between speed and accuracy, account for protein conformation flexibility, and ...

show abstract

Section: The Contemporary Landscapementioning

confidence: 99%

Section: Challengesmentioning

confidence: 99%

Section: Challengesmentioning

confidence: 99%

Section: Prospectsmentioning

confidence: 99%

See 2 more Smart Citations

Advancing Ligand Docking through Deep Learning: Challenges and Prospects in Virtual Screening

Zhang,

Shen,

Zhang

et al. 2024

Acc. Chem. Res.

View full text Add to dashboard Cite

show abstract

“…Unfortunately, there is no common universally adopted definition of pocket, making empirical direct comparisons between these approaches very hard. For instance, different methods 88 use boxes or spheres of various sizes around the true ligand pose to define the pocket, while others 85 consider the pocket as the set of residues of the proteins interacting with the ligand. 2.…”

Section: Modeling Choicesmentioning

confidence: 99%

Diffusion models in protein structure and docking

Yim,

Stärk,

Corso

et al. 2024

WIREs Comput Mol Sci

View full text Add to dashboard Cite

Generative AI is rapidly transforming the frontier of research in computational structural biology. Indeed, recent successes have substantially advanced protein design and drug discovery. One of the key methodologies underlying these advances is diffusion models (DM). Diffusion models originated in computer vision, rapidly taking over image generation and offering superior quality and performance. These models were subsequently extended and modified for uses in other areas including computational structural biology. DMs are well equipped to model high dimensional, geometric data while exploiting key strengths of deep learning. In structural biology, for example, they have achieved state‐of‐the‐art results on protein 3D structure generation and small molecule docking. This review covers the basics of diffusion models, associated modeling choices regarding molecular representations, generation capabilities, prevailing heuristics, as well as key limitations and forthcoming refinements. We also provide best practices around evaluation procedures to help establish rigorous benchmarking and evaluation. The review is intended to provide a fresh view into the state‐of‐the‐art as well as highlight its potentials and current challenges of recent generative techniques in computational structural biology.This article is categorized under: Data Science > Artificial Intelligence/Machine Learning Structure and Mechanism > Molecular Structures Software > Molecular Modeling

show abstract

Deep learning of protein energy landscape and conformational dynamics from experimental structures in PDB

Tang,

Yu,

Bai

et al. 2024

Preprint

View full text Add to dashboard Cite

Protein structure prediction has reached revolutionary levels of accuracy on single structures, implying biophysical energy function can be learned from known protein structures. However apart from single static structure, conformational distributions and dynamics often control protein biological functions. In this work, we tested a hypothesis that protein energy landscape and conformational dynamics can be learned from experimental structures in PDB and coevolution data. Towards this goal, we develop DeepConformer, a diffusion generative model for sampling protein conformation distributions from a given amino acid sequence. Despite the lack of molecular dynamics (MD) simulation data in training process, DeepConformer captured conformational flexibility and dynamics (RMSF and covariance matrix correlation) similar to MD simulation and reproduced experimentally observed conformational variations. Our study demonstrated that DeepConformer learned energy landscape can be used to efficiently explore protein conformational distribution and dynamics.

show abstract

DynamicBind: Predicting ligand-specific protein-ligand complex structure with a deep equivariant generative model

Cited by 3 publications

References 44 publications

Advancing Ligand Docking through Deep Learning: Challenges and Prospects in Virtual Screening

Advancing Ligand Docking through Deep Learning: Challenges and Prospects in Virtual Screening

Diffusion models in protein structure and docking

Deep learning of protein energy landscape and conformational dynamics from experimental structures in PDB

Contact Info

Product

Resources

About