Successive Statistical and Structure-Based Modeling to Identify Chemically Novel Kinase Inhibitors

Burggraaff, Lindsey; Lenselink, Eelke B.; Jespers, Willem; Engelen, Jesper E. van; Bongers, Brandon J.; González, Marina Gorostiola; Liu, Ronghui; Hoos, Holger H.; Vlijmen, Herman van; IJzerman, Adriaan P.

doi:10.1021/acs.jcim.9b01204

Cited by 9 publications

(6 citation statements)

References 45 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Burggraaff et al have successfully carried out a statistical and structure-based virtual screening for the discovery of several RET kinase inhibitors. 128 …”

Section: Fundamentals Of Computer-aided Drug Design (Cadd)mentioning

confidence: 99%

Fundamental considerations in drug design

Mahapatra,

Karuppasamy

2022

Computer Aided Drug Design (CADD): From Ligand-Based Methods to Structure-Based Approaches

View full text Add to dashboard Cite

“…Burggraaff et al have successfully carried out a statistical and structure-based virtual screening for the discovery of several RET kinase inhibitors. 128 …”

Section: Fundamentals Of Computer-aided Drug Design (Cadd)mentioning

confidence: 99%

Fundamental considerations in drug design

Mahapatra,

Karuppasamy

2022

Computer Aided Drug Design (CADD): From Ligand-Based Methods to Structure-Based Approaches

View full text Add to dashboard Cite

“…Generally, these computational methods can be classified into two major categories: structure- and ligand-based kinase inhibition and/or profiling prediction approaches (called virtual assay). Molecular docking, commonly used in structure-based prediction methods for kinase inhibition, has good generalizability, but its accuracy depends on the crystal structure of the kinase and the accuracy of the scoring function [ 13 , 14 ]. Ligand-based methods include pharmacophore modelling, and quantitative structure–activity relationship (QSAR) [ 15 – 21 ].…”

Section: Introductionmentioning

confidence: 99%

Large-scale comparison of machine learning methods for profiling prediction of kinase inhibitors

Wu,

Chen,

et al. 2024

J Cheminform

View full text Add to dashboard Cite

Conventional machine learning (ML) and deep learning (DL) play a key role in the selectivity prediction of kinase inhibitors. A number of models based on available datasets can be used to predict the kinase profile of compounds, but there is still controversy about the advantages and disadvantages of ML and DL for such tasks. In this study, we constructed a comprehensive benchmark dataset of kinase inhibitors, involving in 141,086 unique compounds and 216,823 well-defined bioassay data points for 354 kinases. We then systematically compared the performance of 12 ML and DL methods on the kinase profiling prediction task. Extensive experimental results reveal that (1) Descriptor-based ML models generally slightly outperform fingerprint-based ML models in terms of predictive performance. RF as an ensemble learning approach displays the overall best predictive performance. (2) Single-task graph-based DL models are generally inferior to conventional descriptor- and fingerprint-based ML models, however, the corresponding multi-task models generally improves the average accuracy of kinase profile prediction. For example, the multi-task FP-GNN model outperforms the conventional descriptor- and fingerprint-based ML models with an average AUC of 0.807. (3) Fusion models based on voting and stacking methods can further improve the performance of the kinase profiling prediction task, specifically, RF::AtomPairs + FP2 + RDKitDes fusion model performs best with the highest average AUC value of 0.825 on the test sets. These findings provide useful information for guiding choices of the ML and DL methods for the kinase profiling prediction tasks. Finally, an online platform called KIPP (https://kipp.idruglab.cn) and python software are developed based on the best models to support the kinase profiling prediction, as well as various kinase inhibitor identification tasks including virtual screening, compound repositioning and target fishing.

show abstract

“…In the traditional sense, QSPR modelling focuses mainly on describing the relationship between the compound structure and a property of interest, but proteochemometric modelling (PCM) has emerged as an extension that also introduces the protein target information into the equation [13,14]. A PCM approach can extrapolate similarities and differences across (super)families and is therefore promising in poly-pharmacology and off-target prediction [15], as well as a strategy for data augmentation and relevant binding residue identification [16,17]. Although in the traditional sense, the architecture is identical to that of a single-task model, it includes bio-activity endpoints for multiple proteins, by featurizing each compound-protein combination separately [18].…”

Section: Introductionmentioning

confidence: 99%

QSPRpred: a Flexible Open-Source Quantitative Structure-Property Relationship Modelling Tool

van den Maagdenberg,

Šícho,

Alencar Araripe

et al. 2024

Preprint

View full text Add to dashboard Cite

Building reliable and robust quantitative structure-property relationship (QSPR) models is a challenging task. First, the experimental data needs to be obtained, analyzed and curated. Second, the number of available methods is continuously growing and evaluating different algorithms and methodologies can be arduous. Finally, the last hurdle that researchers face is to ensure the reproducibility of their models and facilitate their transferability into practice. In this work, we introduce QSPRpred, a toolkit for analysis of bioactivity data sets and QSPR modelling, which attempts to address the aforementioned challenges. QSPRpred's modular Python API enables users to intuitively describe different parts of a modelling workflow using a plethora of pre-implemented components, but also integrate customized implementations in a "plug-and-play" manner. QSPRpred data sets and models are directly serializable, which means they can be readily reproduced and put into operation after training as the models are saved with all required data pre-processing steps to make predictions on new compounds directly from SMILES strings. The general-purpose character of QSPRpred is also demonstrated by inclusion of support for multi-task and proteochemometric modelling. The package is extensively documented and comes with a large collection of tutorials to help new users. In this paper, we describe all of QSPRpred's functionalities and also conduct a small benchmarking case study to illustrate how different components can be leveraged to compare a diverse set of models. QSPRpred is fully open-source and available at https://github.com/CDDLeiden/QSPRpred. Scientific Contribution QSPRpred aims to provide a complex, but comprehensive Python API to conduct all tasks encountered in QSPR modelling from data preparation and analysis to model creation and model deployment. In contrast to similar packages, QSPRpred offers a wider and more exhaustive range of capabilities and integrations with many popular packages that also go beyond QSPR modelling. A significant contribution of QSPRpred is also in its automated and highly standardized serialization scheme, which significantly improves reproducibility and transferability of models.

show abstract

Successive Statistical and Structure-Based Modeling to Identify Chemically Novel Kinase Inhibitors

Cited by 9 publications

References 45 publications

Fundamental considerations in drug design

Fundamental considerations in drug design

Large-scale comparison of machine learning methods for profiling prediction of kinase inhibitors

QSPRpred: a Flexible Open-Source Quantitative Structure-Property Relationship Modelling Tool

Contact Info

Product

Resources

About