2009
DOI: 10.1016/j.sbi.2009.02.001
|View full text |Cite
|
Sign up to set email alerts
|

Is protein classification necessary? Toward alternative approaches to function annotation

Abstract: The current non-redundant protein sequence database contains over seven million entries and the number of individual functional domains is significantly larger than this value. The vast quantity of data associated with these proteins poses enormous challenges to any attempt at function annotation. Classification of proteins into sequence and structural groups has been widely used as an approach to simplifying the problem. In this article we question such strategies. We describe how the multifunctionality and s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

1
40
0

Year Published

2010
2010
2015
2015

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 47 publications
(41 citation statements)
references
References 40 publications
1
40
0
Order By: Relevance
“…The question if protein space is continuous or discrete has been extensively debated (2, 3, 5-10), and is interesting both fundamentally and for its implications on how to organize and search protein databases (5,16). The domain network allows us to describe "continuous" and "discrete" more concretely based on the sizes and number of connected components.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…The question if protein space is continuous or discrete has been extensively debated (2, 3, 5-10), and is interesting both fundamentally and for its implications on how to organize and search protein databases (5,16). The domain network allows us to describe "continuous" and "discrete" more concretely based on the sizes and number of connected components.…”
Section: Discussionmentioning
confidence: 99%
“…2,5,6,9,[13][14][15]. If such similarities are abundant, then one must account for them when organizing and searching proteins (5,8,16). In support of the abundance of such similarities is the remarkable success of structure prediction methods that piece together predictions of protein fragments or larger protein segments (e.g., ref.…”
mentioning
confidence: 99%
“…Although in the latter case one is typically restricted to studying sets of proteins of the same label (e.g., fold), maps display structural similarities among all proteins in a single representation. Indeed, it was previously noted that considering all structural similarities is advantageous for studying protein structure and function (18,19). The efficiency of our method renders the calculation of maps for the full Protein Data Bank (PDB) possible, as opposed to only a sparse sample from it.…”
Section: Discussionmentioning
confidence: 99%
“…[6][7][8]. One may view the collection of all possible tertiary structures as a protein structural space.…”
mentioning
confidence: 99%