2014
DOI: 10.1007/s10822-014-9827-y
|View full text |Cite
|
Sign up to set email alerts
|

A comparative study of family-specific protein–ligand complex affinity prediction based on random forest approach

Abstract: The assessment of binding affinity between ligands and the target proteins plays an essential role in drug discovery and design process. As an alternative to widely used scoring approaches, machine learning methods have also been proposed for fast prediction of the binding affinity with promising results, but most of them were developed as all-purpose models despite of the specific functions of different protein families, since proteins from different function families always have different structures and phys… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
42
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 46 publications
(42 citation statements)
references
References 42 publications
0
42
0
Order By: Relevance
“…Compared to DTs, it is impossible that RF over-fits the data, and the RF has been used for bioactivity data classification [81], toxicity modeling [82], and drug target prediction [83], etc. Wang et al [84] used the RF approach to model the binding affinity of protein-ligand on 170 HIV-1 proteases complexes, 110 trypsin complexes, and 126 carbonic anhydrase complexes, which demonstrated that individual representation and model construction for each protein family is a more reasonable way in predicting the affinity of one particular protein family.…”
Section: Classical Qsar Methodsmentioning
confidence: 99%
“…Compared to DTs, it is impossible that RF over-fits the data, and the RF has been used for bioactivity data classification [81], toxicity modeling [82], and drug target prediction [83], etc. Wang et al [84] used the RF approach to model the binding affinity of protein-ligand on 170 HIV-1 proteases complexes, 110 trypsin complexes, and 126 carbonic anhydrase complexes, which demonstrated that individual representation and model construction for each protein family is a more reasonable way in predicting the affinity of one particular protein family.…”
Section: Classical Qsar Methodsmentioning
confidence: 99%
“…As previously argued in the context of diverse test sets, Wang et al’s SF did not perform better than RF‐Score in every target class despite using a far more precise description of the complex. Actually, the new SF only outperformed RF‐Score on HIV protease and carbonic anhydrase, half of the evaluated target classes. The situation could be different when building family‐specific SFs, as complexes of a protein family tend to be more similar and thus more likely to be well described by a single precise characterization (e.g., adding features that are rare across all targets but common within the family such as the presence of a particular metal ion).…”
Section: Family‐specific Machine‐learning Sfsmentioning
confidence: 99%
“…Xue et al 24 used 94 drugs against human serum albumin and have used SVM models for prediction the binding affinity. Deng et al 27 has used 105 diverse protein-ligand complexes and Wang et al 29 used pdbbind version 2012. Wang et al…”
Section: Comparative Studymentioning
confidence: 99%