2011
DOI: 10.1016/j.drudis.2011.10.005
|View full text |Cite
|
Sign up to set email alerts
|

Making every SAR point count: the development of Chemistry Connect for the large-scale integration of structure and bioactivity data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
82
0

Year Published

2012
2012
2024
2024

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 72 publications
(82 citation statements)
references
References 23 publications
0
82
0
Order By: Relevance
“…1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 16 To create (Q)SAR models, we used the program GUSAR. 17,18 A combination of three types of descriptors is used in GUSAR: (1) QNA (Quantitative Neighborhoods of Atoms) descriptors, (2) descriptors that are both topological (length and volume) and physicochemical parameters of the whole molecule, and (3) descriptors based on the prediction of the biological activity spectra by the program PASS. 19,20 The robustness of the GUSAR algorithm vis-à-vis the utilization of data sets that are non-homogeneous (in terms of chemical similarity) has been shown earlier.…”
Section: Modeling Set Preparation From Chembl Datamentioning
confidence: 99%
“…1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 16 To create (Q)SAR models, we used the program GUSAR. 17,18 A combination of three types of descriptors is used in GUSAR: (1) QNA (Quantitative Neighborhoods of Atoms) descriptors, (2) descriptors that are both topological (length and volume) and physicochemical parameters of the whole molecule, and (3) descriptors based on the prediction of the biological activity spectra by the program PASS. 19,20 The robustness of the GUSAR algorithm vis-à-vis the utilization of data sets that are non-homogeneous (in terms of chemical similarity) has been shown earlier.…”
Section: Modeling Set Preparation From Chembl Datamentioning
confidence: 99%
“…Advances in screening technologies and an increasing awareness of the value of biological data utilization have led to the accumulation of diverse small molecule bioactivity data in public and corporate repositories [1,2]. The wealth of information ranges from quantitative (such as gene expression, cellular phenotypes and protein phosphorylation) to categorical (such as clinical adverse events) data.…”
Section: Introductionmentioning
confidence: 99%
“…Compound structures were standardized according to AstraZeneca rules 13 and used for compound identification and comparison of compounds tested in both internal and external HTS assays. 4 Compound activity between primary HTS assays screened on the same human-derived protein target (defined by gene symbol) was analyzed using compound activity overlap A 1,2 /(A 1 + A 2 + A 1,2 ), where A 1,2 is the number of compounds active in both assay 1 and assay 2, and A 1 and A 2 are the number of compounds only active in one of the assays and inactive in the other. Where more than two assays were evaluated for the same target, the compound activity was analyzed both in pairs as above and by evaluating the compound activity overlap (also calculated in assay pairs according to the equation above) using only compounds tested in all three assays.…”
Section: Compound Activity Analysismentioning
confidence: 99%
“…There have been both proprietary and open public initiatives to improve the integration of HTS data from different sources. One example of a proprietary solution is the ChemistryConnect application, 4 while Open PHACTS (http://openphacts.org) and BARD (http://bard. nih.gov) are examples of two major publicly funded initiatives for better integration of screening data.…”
Section: Introductionmentioning
confidence: 99%