2023
DOI: 10.1021/acs.jcim.2c01253
|View full text |Cite
|
Sign up to set email alerts
|

ZINC-22─A Free Multi-Billion-Scale Database of Tangible Compounds for Ligand Discovery

Abstract: Purchasable chemical space has grown rapidly into the tens of billions of molecules, providing unprecedented opportunities for ligand discovery but straining the tools that might exploit these molecules at scale. We have therefore developed ZINC-22, a database of commercially accessible small molecules derived from multi-billion-scale make-on-demand libraries. The new database and tools enable analog searching in this vast new space via a facile GUI, CartBlanche, drawing on similarity methods that scale sublin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
49
0
1

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 85 publications
(50 citation statements)
references
References 33 publications
0
49
0
1
Order By: Relevance
“…With the determination of the cryo-EM structure of the SARS-CoV-2 Nsp10-Nsp14 complex (PDB ID 7N0B), and the development of a larger “tangible” ZINC22 library of 1.1 billion molecules, we launched a second docking screen. The 680 million molecules previously docked were a subset of this 1.1 billion molecule set; the key differences in docking came from the change in orientation of the side chain of Asp352 in the cryo-EM structure of SARS-CoV-2 Nsp14 (PDB ID 7N0B).…”
Section: Resultsmentioning
confidence: 99%
“…With the determination of the cryo-EM structure of the SARS-CoV-2 Nsp10-Nsp14 complex (PDB ID 7N0B), and the development of a larger “tangible” ZINC22 library of 1.1 billion molecules, we launched a second docking screen. The 680 million molecules previously docked were a subset of this 1.1 billion molecule set; the key differences in docking came from the change in orientation of the side chain of Asp352 in the cryo-EM structure of SARS-CoV-2 Nsp14 (PDB ID 7N0B).…”
Section: Resultsmentioning
confidence: 99%
“…The similarity baseline simply prioritizes the closest analogues of known actives and is related to standard approaches known as "hit expansion" or "analogue by catalogue." 4 To further explore why the ensemble methods did not improve the performance over the best individual models (Figures S1 and S2), we examined the overlap in actives retrieved between the ensembles and top models (Tables S2− S6). Although the ensembles found some actives not prioritized by the best RF-C model, the RF-C found many more actives, leading to better performance.…”
Section: Resultsmentioning
confidence: 99%
“…Commercial libraries, which are the primary source of compounds in academic screening efforts, can be grouped into two categories: “in-stock” and “synthesize-on-demand.” As of 2021, in-stock libraries comprise over 12 million molecules previously synthesized and physically stored by vendors . Synthesize-on-demand libraries are databases containing virtual molecules that a vendor considers to be readily accessible given stocks of available building blocks and established synthetic routes. , Such libraries now measure in the billions: the ZINC-22 database aggregates 37 billion compounds from the Enamine REAL, WuXi GalaXi, and Mcule Ultimate catalogues . Synthesize-on-demand libraries have superior chemical scaffold diversity and coverage of chemical shape characteristics, unlocking new possibilities to manipulate biological targets.…”
Section: Introductionmentioning
confidence: 99%
“…We wondered if there might be many molecules in a general library that might be suitable to form symmetrical stacks in a fibril. To test how symmetry docking would work in a large library screen, we docked 22 million molecules from the ZINC22 database 36 against the GTP-1-bound form of AD PHF tau. We selected molecules that were similar in gross properties to GTP-1: neutral, with heavy atom count between 21 and 23, and clogP values between 3.30 and 3.50.…”
Section: Resultsmentioning
confidence: 99%