1998
DOI: 10.1021/ci980003j
|View full text |Cite
|
Sign up to set email alerts
|

Rational Screening Set Design and Compound Selection:  Cascaded Clustering

Abstract: The use of cascaded clustering is reported. This technique was developed to permit the application of Jarvis-Patrick clustering based on structural fingerprints to large chemical databases, while keeping the maximum cluster size and the number of singletons produced at reasonable levels. The basis for the algorithm, its implementation, and validation are described. In the first part of the paper, the approach is used to create a representative subset of compounds for biological testing from the corporate compo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
48
0

Year Published

1998
1998
2012
2012

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 42 publications
(48 citation statements)
references
References 22 publications
0
48
0
Order By: Relevance
“…This option (recursive OptiSim clustering) may be the alternative of choice when working with heterogeneously distributed populations for which the extent of structural variation is misleadingly different in different regions of the structural space; an analogous approach has recently been proposed for improving the distributions obtained from Jarvis-Patrick clustering. 13 Interaction between Subset Size M and Subsample Size K. The results described above were obtained for a relatively small population (N ) 1000) and for a fixed subset size (M ) 30). Figure 5 shows the results obtained when OptiSim was used to select from 5 to 50 compounds from the full mixed library of 6600 compounds.…”
Section: Resultsmentioning
confidence: 97%
“…This option (recursive OptiSim clustering) may be the alternative of choice when working with heterogeneously distributed populations for which the extent of structural variation is misleadingly different in different regions of the structural space; an analogous approach has recently been proposed for improving the distributions obtained from Jarvis-Patrick clustering. 13 Interaction between Subset Size M and Subsample Size K. The results described above were obtained for a relatively small population (N ) 1000) and for a fixed subset size (M ) 30). Figure 5 shows the results obtained when OptiSim was used to select from 5 to 50 compounds from the full mixed library of 6600 compounds.…”
Section: Resultsmentioning
confidence: 97%
“…One is to add the obtained singletons iteratively to the clusters in a fuzzy way [52]. Another approach is to iteratively remove singletons and re-cluster them using milder Jarvis-Patrick conditions [53]. Irrespective of these developments new methods are introduced continuously [54 ± 56].…”
Section: Clustering and Partitioning Approachesmentioning
confidence: 99%
“…[28][29][30][31][32][33][34][35] In a notable exception, decision trees were used to model activity on a small number of different targets simultaneously. 36 Alternatively, clustering algorithms have been widely used to identify distinct biologically active compound classes [37][38][39][40] and to segregate compounds into distinct structural classes that differ in their assay phenotypes. 41 This paper explores the potential of computational tools, namely clustering and an entropy-based coincidence score, to identify compound classes active in multiple assays and generate hypotheses about a compound's biological target(s) or mechanism(s) of action.…”
Section: Introductionmentioning
confidence: 99%