2010
DOI: 10.1021/ci100203e
|View full text |Cite
|
Sign up to set email alerts
|

Compound Set Enrichment: A Novel Approach to Analysis of Primary HTS Data

Abstract: The main goal of high-throughput screening (HTS) is to identify active chemical series rather than just individual active compounds. In light of this goal, a new method (called compound set enrichment) to identify active chemical series from primary screening data is proposed. The method employs the scaffold tree compound classification in conjunction with the Kolmogorov-Smirnov statistic to assess the overall activity of a compound scaffold. The application of this method to seven PubChem data sets (containin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
63
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
5
5

Relationship

2
8

Authors

Journals

citations
Cited by 55 publications
(64 citation statements)
references
References 31 publications
1
63
0
Order By: Relevance
“…Furthermore, our finding that noisy primary screening data can yield valuable information is consistent with other research in the field that successfully mine primary screening data while ignoring the data from confirmatory experiments 29. 9 For example, compound set enrichment (CSE) identifies statistically significant distributions of scaffolds in the primary screening data, enabling the detection of groups of active molecules that were initially missed 29. In a similar manner, local hit rate analysis (LHR) identifies clusters of molecules whose distribution is statistically significant in the primary screening data 9…”
Section: Discussionmentioning
confidence: 99%
“…Furthermore, our finding that noisy primary screening data can yield valuable information is consistent with other research in the field that successfully mine primary screening data while ignoring the data from confirmatory experiments 29. 9 For example, compound set enrichment (CSE) identifies statistically significant distributions of scaffolds in the primary screening data, enabling the detection of groups of active molecules that were initially missed 29. In a similar manner, local hit rate analysis (LHR) identifies clusters of molecules whose distribution is statistically significant in the primary screening data 9…”
Section: Discussionmentioning
confidence: 99%
“…Such actives could possibly be rescued with post-HTS data-mining techniques aimed at identifying latent hits. 23,24 Both compounds 5 and 6 were annotated as weak TP at the 8.3 µM screening concentration but found as FN at all other doses (Fig. 4b, c).…”
Section: Case Iii: Determining Optimal Screening Concentration With Qmentioning
confidence: 96%
“…Further investigations of identified scaffolds with a higher propensity for 'activity cliffs' and 'selectivity cliffs' [119], which might also serve as particular interesting starting points for library design. A strategy to identify latent privileged scaffolds for a particular target from primary future science group screening data was proposed by Varin et al [120]. The approach identifies scaffolds generated by the scaffold tree that are statistically more active than the background according to the primary screening data.…”
Section: Data Mining For Promising Library Design Starting Pointsmentioning
confidence: 99%