2016
DOI: 10.1021/acs.jcim.5b00332
|View full text |Cite
|
Sign up to set email alerts
|

Feasibility of Active Machine Learning for Multiclass Compound Classification

Abstract: A common task in the hit-to-lead process is classifying sets of compounds into multiple, usually structural classes, which build the groundwork for subsequent SAR studies. Machine learning techniques can be used to automate this process by learning classification models from training compounds of each class. Gathering class information for compounds can be cost-intensive as the required data needs to be provided by human experts or experiments. This paper studies whether active machine learning can be used to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

3
52
1

Year Published

2017
2017
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 44 publications
(56 citation statements)
references
References 36 publications
3
52
1
Order By: Relevance
“…Rarey and coworkers recently published an active learning framework for a multitarget problem in drug discovery, namely the ability of active learning combined with chemogenomic reasoning to build a predictive model for a target subfamily by using only a subset of available data [56]. Their investigation demonstrated efficient navigation of focused ligand-target spaces and provided a means for model training and experimental design.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Rarey and coworkers recently published an active learning framework for a multitarget problem in drug discovery, namely the ability of active learning combined with chemogenomic reasoning to build a predictive model for a target subfamily by using only a subset of available data [56]. Their investigation demonstrated efficient navigation of focused ligand-target spaces and provided a means for model training and experimental design.…”
Section: Introductionmentioning
confidence: 99%
“…That is, we investigate the potential of active learning to act as the steering wheel for family-wide computational chemogenomic model building. We evaluate its ability to predict bioactivity and how it evolves a chemogenomic model [49,[56][57][58], finding that protein/target family-wide chemogenomic active learning can build an interaction model from only a small fraction of bioactivity data points in a screening database. These models show high predictive performance on datasets many folds larger than that used for model construction.…”
Section: Introductionmentioning
confidence: 99%
“…To date, several pro-and retrospective studies report successful applications of AL strategies throughout different fields of research [27][28][29][30][31][32][33][34][35][36][37][38][39][40][41][42][43][44][45][46]. In the area of drug discovery, active learning has been shown to efficiently derive high-performance prediction models based on small subsets of input data [32,34,39,46].…”
Section: Introductionmentioning
confidence: 99%
“…In the area of drug discovery, active learning has been shown to efficiently derive high-performance prediction models based on small subsets of input data [32,34,39,46]. Furthermore, actively trained models not only reached significantly higher hit rates compared to experimental standards which frequently remain below 1 % in cases of unbiased chemical libraries [34,39,40,[47][48][49], but such models also contributed to successful identification of novel bioactive compounds [33,36,42] and cancer rescue mutants of p53 [31].…”
Section: Introductionmentioning
confidence: 99%
“…Machine learning techniques such as ANN can be used to automate this process by learning classification models from training compounds of each class [9]. It is important that the input contains all the latent feature information of the chemical compound.…”
Section: Applicationsmentioning
confidence: 99%