2024
DOI: 10.1038/s41597-023-02872-y
|View full text |Cite|
|
Sign up to set email alerts
|

PLAS-20k: Extended Dataset of Protein-Ligand Affinities from MD Simulations for Machine Learning Applications

Divya B. Korlepara,
Vasavi C. S.,
Rakesh Srivastava
et al.

Abstract: Computing binding affinities is of great importance in drug discovery pipeline and its prediction using advanced machine learning methods still remains a major challenge as the existing datasets and models do not consider the dynamic features of protein-ligand interactions. To this end, we have developed PLAS-20k dataset, an extension of previously developed PLAS-5k, with 97,500 independent simulations on a total of 19,500 different protein-ligand complexes. Our results show good correlation with the available… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 43 publications
0
2
0
Order By: Relevance
“…The particle mesh Ewald algorithm was applied to the long-range electrostatic interactions using cutoff distance of 10.0 Å [52]. Lastly, the simulations were conducted using PMEMD.cuda, and the trajectories were analyzed using the CPPTRAJ package [53].…”
Section: Simulationmentioning
confidence: 99%
“…The particle mesh Ewald algorithm was applied to the long-range electrostatic interactions using cutoff distance of 10.0 Å [52]. Lastly, the simulations were conducted using PMEMD.cuda, and the trajectories were analyzed using the CPPTRAJ package [53].…”
Section: Simulationmentioning
confidence: 99%
“…Several databases are available that contain raw experimental structures of protein–ligand complexes, usually extracted from the PDB (for example, PDBbind 25 , bindingDB 26 , Binding MOAD 27 , Sperrylite 28 ). Only recently a database of MD-derived traces of protein–ligand structures was reported 29 , 30 . Despite these efforts, so far no AI model has been proposed that convincingly addresses the rational DD challenge in the way that AlphaFold2 answered the protein structure prediction problem 31 , 32 .…”
Section: Mainmentioning
confidence: 99%