2024
DOI: 10.1021/acs.jcim.4c00220
|View full text |Cite
|
Sign up to set email alerts
|

Benchmarking Active Learning Protocols for Ligand-Binding Affinity Prediction

Rohan Gorantla,
Alžbeta Kubincová,
Benjamin Suutari
et al.

Abstract: Active learning (AL) has become a powerful tool in computational drug discovery, enabling the identification of top binders from vast molecular libraries. To design a robust AL protocol, it is important to understand the influence of AL parameters, as well as the features of the data sets on the outcomes. We use four affinity data sets for different targets (TYK2, USP7, D2R, Mpro) to systematically evaluate the performance of machine learning models [Gaussian process (GP) model and Chemprop model], sample sele… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 37 publications
0
2
0
Order By: Relevance
“…We have shown in this article that AL can be an effective method to improve compound optimization for a particular target. It is important to point out how the term "active learning" is being used as we follow here the convention which has been adopted in at least part of the community (Crivelli-Decker et al, 2023;Gorantla et al, 2024;Gusev et al, 2023;Khalak et al, 2022;Knight et al, 2021;Konze et al, 2019;Mohr et al, 2022;Thompson et al, 2022). However, we have also pointed out in the Introduction that we are not principally interested in creating a surrogate model for the purpose of optimally finding new labels i.e., binding affinities.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…We have shown in this article that AL can be an effective method to improve compound optimization for a particular target. It is important to point out how the term "active learning" is being used as we follow here the convention which has been adopted in at least part of the community (Crivelli-Decker et al, 2023;Gorantla et al, 2024;Gusev et al, 2023;Khalak et al, 2022;Knight et al, 2021;Konze et al, 2019;Mohr et al, 2022;Thompson et al, 2022). However, we have also pointed out in the Introduction that we are not principally interested in creating a surrogate model for the purpose of optimally finding new labels i.e., binding affinities.…”
Section: Discussionmentioning
confidence: 99%
“…In AL a, typically, computationally expensive method like docking or MD simulation is used to assign new labels e.g., a free energy of binding (also known as the binding affinity) or a docking score as proxy for the affinity. AL approaches that efficiently optimize compounds for binding affinity with RBFE (relative binding free energy) methods, (also referred to as FEP=free energy perturbation) have only appeared recently in the literature (Crivelli-Decker et al, 2023;de Oliveira et al, 2023;Gorantla et al, 2024;Gusev et al, 2023;Khalak et al, 2022;Knight et al, 2021;Konze et al, 2019;Mohr et al, 2022;Thompson et al, 2022) but have also been used to optimize an RBFE protocol itself (de Oliveira et al, 2023). Combining Generative AI with active learning (GAL) has only started very recently (Filella-Merce et al, 2023) including a proofof-concept study with peptides (Hernandez-Garcia et al, 2023) and an application with ABFE (absolute binding free energy) (Eckmann et al, 2024).…”
Section: Introductionmentioning
confidence: 99%