2007
DOI: 10.1093/bioinformatics/btm580
|View full text |Cite
|
Sign up to set email alerts
|

Genome scale enzyme–metabolite and drug–target interaction predictions using the signature molecular descriptor

Abstract: Motivation: Identifying protein enzymatic or pharmacological activities are important areas of research in biology and chemistry. Biological and chemical databases are increasingly being populated with linkages between protein sequences and chemical structures.There is now sufficient information to apply machine-learning techniques to predict interactions between chemicals and proteins at a genome scale. Current machine-learning techniques use as input either protein sequences and structures or chemical inform… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
128
0
1

Year Published

2009
2009
2019
2019

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 151 publications
(129 citation statements)
references
References 33 publications
0
128
0
1
Order By: Relevance
“…One family we discard since it has too few proteins and interactions. For the proteins we extracted features using the signature molecular descriptors [13], for the chemicals we used a frequent subgraph feature representation approach [20,34], and we used a threshold on the feature frequencies to obtain about 100 features each. We then built the feature vector for a given protein-chemical pair by taking the tensor product between the protein and chemical feature vectors.…”
Section: Protein-chemical Interaction (Data Sets 13 -24)mentioning
confidence: 99%
“…One family we discard since it has too few proteins and interactions. For the proteins we extracted features using the signature molecular descriptors [13], for the chemicals we used a frequent subgraph feature representation approach [20,34], and we used a threshold on the feature frequencies to obtain about 100 features each. We then built the feature vector for a given protein-chemical pair by taking the tensor product between the protein and chemical feature vectors.…”
Section: Protein-chemical Interaction (Data Sets 13 -24)mentioning
confidence: 99%
“…Molecular signatures 41 (MS) are graph-based descriptors that encodes the "neighborhood" of each atom of a molecule, similarly to Morgan's or ECFP fingerprints. Each kind of "neighborhood", or atom environment, is a feature of MS. A reaction's signature (RS) is computed by subtracting the MS of the substrates to the MS of the products of the reaction, 24 and takes the general form…”
Section: ■ Discussionmentioning
confidence: 99%
“…The assay depositor reported a compound as active if IC 50 < 50 µM was obtained in all three IC 50 determinations, inconclusive if IC 50 < 50 µM in only one or two determinations, and inactive for IC 50 > 50 µM. 51 In this work, we reduced the active classification to 5 µM resulting in 47 active and 68 inactive compounds.…”
Section: Training and Test Setsmentioning
confidence: 99%
“…In our work the input vectors used in the SVM are atomic Signatures. Note that Signature has previously been used with a SVM to predict both protein-protein 49 and drug-target 50 interactions. Note that the number of descriptors (referred to as features when used in statistical learning methods) compared to the number of observations is an important consideration to avoid overfitting.…”
Section: Introductionmentioning
confidence: 99%