2023
DOI: 10.1021/acs.jcim.3c00588
|View full text |Cite
|
Sign up to set email alerts
|

Building Block-Based Binding Predictions for DNA-Encoded Libraries

Abstract: DNA-encoded libraries (DELs) provide the means to make and screen millions of diverse compounds against a target of interest in a single experiment. However, despite producing large volumes of binding data at a relatively low cost, the DEL selection process is susceptible to noise, necessitating computational follow-up to increase signal-to-noise ratios. In this work, we present a set of informatics tools to employ data from prior DEL screen(s) to gain information about which building blocks are most likely to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 44 publications
(115 reference statements)
0
3
0
Order By: Relevance
“…Further work is necessary to develop guidelines for three-cycle libraries or modes to identify covalent inhibitors, protein–protein interaction disruptors, or molecular degraders , to accommodate their expanded PCPs. Additional studies comparing different dimensionality reduction techniques for DEL analysis, such as TMAP, GTM, or UMAP, are also likely to be valuable. Finally, while model library size was constrained to 192 × 192 (∼37k numerical diversity), which is at least an order of magnitude smaller than some DELs, the field is migrating toward smaller libraries with more lead-like to drug-like properties. , These trends have accompanied experimental design innovations, enabling the detection of weaker binding events from lower cycle libraries through photoactivatable handles. …”
Section: Resultsmentioning
confidence: 99%
“…Further work is necessary to develop guidelines for three-cycle libraries or modes to identify covalent inhibitors, protein–protein interaction disruptors, or molecular degraders , to accommodate their expanded PCPs. Additional studies comparing different dimensionality reduction techniques for DEL analysis, such as TMAP, GTM, or UMAP, are also likely to be valuable. Finally, while model library size was constrained to 192 × 192 (∼37k numerical diversity), which is at least an order of magnitude smaller than some DELs, the field is migrating toward smaller libraries with more lead-like to drug-like properties. , These trends have accompanied experimental design innovations, enabling the detection of weaker binding events from lower cycle libraries through photoactivatable handles. …”
Section: Resultsmentioning
confidence: 99%
“…To that end, other methods have tackled this problem by utilizing the molecular structure via molecular fingerprints and graph neural networks. McCloskey et al 12 and Zhang et al 13 bin the count data and construct a classification problem based on their discretizations of the data. In particular, Zhang et al 13 propose exploiting the compositional structure of DELs for the extraction of enrichment signals, but unlike our work, no explicit generative models of this factorized representation or the improved likelihoods to deal with DEL data are built.…”
Section: ■ Introductionmentioning
confidence: 99%
“…McCloskey et al and Zhang et al bin the count data and construct a classification problem based on their discretizations of the data. In particular, Zhang et al propose exploiting the compositional structure of DELs for the extraction of enrichment signals, but unlike our work, no explicit generative models of this factorized representation or the improved likelihoods to deal with DEL data are built. Other approaches formulate the problem as a latent-variable prediction task, maximizing the probability of observing the count data under some prescribed probability distribution such as the Poisson or negative binomial distribution. Shmilovich et al extend the representation capabilities of models on DEL data by incorporating 3D docked poses to enhance the performance of models without requiring additional supervised validation data.…”
Section: Introductionmentioning
confidence: 99%