Although significant advances in experimental high throughput screening (HTS) have been made for drug lead identification, in silico virtual screening (VS) is indispensable owing to its unique advantage over experimental HTS, target-focused, cheap, and efficient, albeit its disadvantage of producing false positive hits. For both experimental HTS and VS, the quality of screening libraries is crucial and determines the outcome of those studies. In this paper, we first reviewed the recent progress on screening library construction. We realized the urgent need for compiling high-quality screening libraries in drug discovery. Then we compiled a set of screening libraries from about 20 million druglike ZINC molecules by running fingerprint-based similarity searches against known drug molecules. Lastly, the screening libraries were objectively evaluated using 5847 external actives covering more than 2000 drug targets. The result of the assessment is very encouraging. For example, with the Tanimoto coefficient being set to 0.75, 36% of external actives were retrieved and the enrichment factor was 13. Additionally, drug target family specific screening libraries were also constructed and evaluated. The druglike screening libraries are available for download from https://mulan.pharmacy.pitt.edu.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.