1998
DOI: 10.1021/ci980109e
|View full text |Cite
|
Sign up to set email alerts
|

Molecular Diversity and Representativity in Chemical Databases

Abstract: It is now common practice in the pharmaceutical industry to use molecular diversity selection methods. With the advent of high throughput screening and combinatorial chemistry, compounds must be rationally selected from databases of hundreds of thousands of compounds to be tested for several biological activities. We explore the differences between diversity and representativity. Validation runs were made for different diversity selection methods (such as the MaxMin function), several representativity techniqu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
79
0
2

Year Published

1998
1998
2009
2009

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 96 publications
(81 citation statements)
references
References 23 publications
0
79
0
2
Order By: Relevance
“…For instance, descriptors for the QSAR of relatively small sets of related agents are not applicable for the analysis of chemical diversity. 22 The use of descriptors unrelated to a particular type of properties or biological activity likely generates noise in a statistical learning system, which may affect the prediction accuracy of that system. 22 In some cases, it is difficult to manually select descriptors useful for a particular property.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For instance, descriptors for the QSAR of relatively small sets of related agents are not applicable for the analysis of chemical diversity. 22 The use of descriptors unrelated to a particular type of properties or biological activity likely generates noise in a statistical learning system, which may affect the prediction accuracy of that system. 22 In some cases, it is difficult to manually select descriptors useful for a particular property.…”
Section: Introductionmentioning
confidence: 99%
“…22 The use of descriptors unrelated to a particular type of properties or biological activity likely generates noise in a statistical learning system, which may affect the prediction accuracy of that system. 22 In some cases, it is difficult to manually select descriptors useful for a particular property. Thus methods capable of automatic selection of molecular descriptors are desirable.…”
Section: Introductionmentioning
confidence: 99%
“…Subsequently, a principal component analysis (PCA) study 23,24 was performed using 12 GCUT bidimensional molecular descriptors implemented in MOE 15 to represent the chemical space covered by the ligand of a given target family. Among these descriptors, three were found to be correlated, and were subsequently excluded.…”
Section: And Discussionmentioning
confidence: 99%
“…In this paper, we focus on similarity coefficients, in particular their use for measuring similarities between pairs of 2D fragment bit-strings (or fingerprints). These representations provide a very simple encoding of molecular structure but have been found to yield a surprisingly high level of performance in a range of similarity and diversity studies [2,[6][7][8][9][10].…”
Section: Introductionmentioning
confidence: 99%