2014
DOI: 10.1186/1471-2105-15-87
|View full text |Cite
|
Sign up to set email alerts
|

Estimation of protein function using template-based alignment of enzyme active sites

Abstract: BackgroundThe accumulation of protein structural data occurs more rapidly than it can be characterized by traditional laboratory means. This has motivated widespread efforts to predict enzyme function computationally. The most useful/accurate strategies employed to date are based on the detection of motifs in novel structures that correspond to a specific function. Functional residues are critical components of predictively useful motifs. We have implemented a novel method, to complement current approaches, wh… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
1

Year Published

2015
2015
2020
2020

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 11 publications
(15 citation statements)
references
References 28 publications
(35 reference statements)
0
14
1
Order By: Relevance
“…The two motif sets based on the same set of native structures but generated with different methods exhibited similar hit rates against homologous structures (70.25% and 71.59% for the P Set and the AP Set, respectively) and against unrelated structures (3.79% and 4.32% for the P Set and the AP Set, respectively). The true positive rate for both sets of motifs is different from the 62.63% value reported previously for the full P set of 181 motifs [1]. The differences result from the following changes in the matching algorithm: (1) the use of a more restricted definition of a true positive to be one with a Levenshtein distances ≤ 1, rather than accepting Levenshtein distances of 2 and 3 as in the prior results, (2) the use in the present instance of a smaller set of the most reliable 112 motifs for which positive results with Levenshtein distances ≤ 1 could be achieved, and (3) restricting the definition of a homolog to be a molecule agreeing in both EC number and Pfam family, rather than the EC number alone.…”
Section: Resultscontrasting
confidence: 80%
See 2 more Smart Citations
“…The two motif sets based on the same set of native structures but generated with different methods exhibited similar hit rates against homologous structures (70.25% and 71.59% for the P Set and the AP Set, respectively) and against unrelated structures (3.79% and 4.32% for the P Set and the AP Set, respectively). The true positive rate for both sets of motifs is different from the 62.63% value reported previously for the full P set of 181 motifs [1]. The differences result from the following changes in the matching algorithm: (1) the use of a more restricted definition of a true positive to be one with a Levenshtein distances ≤ 1, rather than accepting Levenshtein distances of 2 and 3 as in the prior results, (2) the use in the present instance of a smaller set of the most reliable 112 motifs for which positive results with Levenshtein distances ≤ 1 could be achieved, and (3) restricting the definition of a homolog to be a molecule agreeing in both EC number and Pfam family, rather than the EC number alone.…”
Section: Resultscontrasting
confidence: 80%
“…As part of the Structural Biology Extensible Visualization Scripting Language (SBEVSL) initiative, ProMOL [1] is a plugin for the molecular graphics system PyMOL [2]. ProMOL is written in Python 2.7 and utilizes a template-based function prediction method in an attempt to classify proteins of known structure but unknown function [1].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Structures were selected based on the availability of literature definitions of active sites in the Catalytic Site Atlas [15]. Two sets of motif templates make up the current library: roughly half were generated manually with the Motif Maker tool in ProMOL [11], which can be used to create motif templates based on residue name, residue number and chain identifier for the residues in the active site of an enzyme structure found in the PDB. The remaining motif templates were generated automatically with a script that tests the motif template against itself, homologs and random PDB structures to provide sensitivity and specificity data for each new motif template [manuscript in preparation].…”
Section: Methodsmentioning
confidence: 99%
“…We have developed the ProMOL [11] plugin for PyMOL [12], a tool used to explore the catalytic site structural homologies between proteins of known function and those for which functions are not yet known. ProMOL uses template-based alignment of these structures with a current library of 388 active site motifs as reported in the Catalytic Site Atlas [13].…”
Section: Introductionmentioning
confidence: 99%