2020
DOI: 10.1038/s41598-020-61860-z
|View full text |Cite
|
Sign up to set email alerts
|

Improving detection of protein-ligand binding sites with 3D segmentation

Abstract: In recent years machine learning (ML) took bioand cheminformatics fields by storm, providing new solutions for a vast repertoire of problems related to protein sequence, structure, and interactions analysis. ML techniques, deep neural networks especially, were proven more effective than classical models for tasks like predicting binding affinity for molecular complex.In this work we investigated the earlier stage of drug discovery process -finding druggable pockets on protein surface, that can be later used to… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
118
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 94 publications
(119 citation statements)
references
References 22 publications
1
118
0
Order By: Relevance
“…Though it was one of the best sequence-based performers, the authors note that structure-based methods still outperform sequence-based ones, even with the help of machine learning 43 . Some of the newest developments in this area include DeepSite 44 , DeepCSeqSite 45 , Kalasanty 46 , and UTProt Galaxy pipeline 47 . These methods all utilize 3D convolutional neural networks with various information involving sequence, distances, and other physicochemical parameters to characterize putative binding pockets.…”
Section: Lbs-prediction Methods Lbs-prediction Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Though it was one of the best sequence-based performers, the authors note that structure-based methods still outperform sequence-based ones, even with the help of machine learning 43 . Some of the newest developments in this area include DeepSite 44 , DeepCSeqSite 45 , Kalasanty 46 , and UTProt Galaxy pipeline 47 . These methods all utilize 3D convolutional neural networks with various information involving sequence, distances, and other physicochemical parameters to characterize putative binding pockets.…”
Section: Lbs-prediction Methods Lbs-prediction Methodsmentioning
confidence: 99%
“…Kalasanty is a machine-learning method which utilizes a 3D fully convolutional neural network which characterizes protein binding pockets using physicochemical characteristics of protein atoms distributed across a 70 Å cubic grid 46 . The feature information used in their calculation describes: atom type, hybridization, number of bonds with other heavy atoms, number of bonds with other hetero atoms, encoding properties (hydrophobic, aromatic, acceptor, donor, and ring) of groups, and whether an atom belongs to a ligand or protein.…”
Section: Assessment Metrics Statistical Tests (Correlation R 2 T-tmentioning
confidence: 99%
“…DeepSite 46 follows a similar approach to P2Rank as they use a CNN to score all points on the protein surface and clusters all points with high scores to generate candidate binding pockets. Kalasanty, 47 on the other hand, passes the entire protein structure through a CNN-based segmentation model inspired by the U-Net 48 to generate the predicted binding sites in one step. It assigns a probability to each voxel of being part of a pocket.…”
Section: Introductionmentioning
confidence: 99%
“…Nonetheless, a variety of molecular descriptors have been identified for representing a molecule in a 3D data structure (vide infra) and successfully used for a diverse set of problems ranging from protein-ligand binding affinity prediction [2][3][4][5][6][7][8][9] and receptor binding site detection and classification [10][11][12][13] to the prediction of material properties 14,15 and NMR chemical shifts. 16 A major complication associated with 3D input representations is its high data sparsity aggravated due to its 3D grid cell structure; therefore, in this article, we advocate the use of spatially dense descriptors, especially the ones based on the electron distribution in the molecule, thus providing an alternative to mitigate the data structure sparsity.…”
Section: Introductionmentioning
confidence: 99%