2007
DOI: 10.1073/pnas.0611678104
|View full text |Cite
|
Sign up to set email alerts
|

Growth of novel protein structural data

Abstract: Contrary to popular assumption, the rate of growth of structural data has slowed, and the Protein Data Bank (PDB) has not been growing exponentially since 1995. Reaching such a dramatic conclusion requires careful measurement of growth of novel structures, which can be achieved by clustering entry sequences, or by using a novel index to down-weight entries with a higher number of sequence neighbors. These measures agree, and growth rates are very similar for entire PDB files, clusters, and weighted chains.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
98
0

Year Published

2007
2007
2014
2014

Publication Types

Select...
5
4
1

Relationship

2
8

Authors

Journals

citations
Cited by 145 publications
(98 citation statements)
references
References 44 publications
0
98
0
Order By: Relevance
“…Models within this threshold distance from the ith model are summed to give Neighbors i ; an initial estimate for k ¼ Σ i 1∕ ðNeighbors i þ 1Þ (45). If the initial estimate is too small, clusters would be merged, yielding less insightful interpretations.…”
Section: Methodsmentioning
confidence: 99%
“…Models within this threshold distance from the ith model are summed to give Neighbors i ; an initial estimate for k ¼ Σ i 1∕ ðNeighbors i þ 1Þ (45). If the initial estimate is too small, clusters would be merged, yielding less insightful interpretations.…”
Section: Methodsmentioning
confidence: 99%
“…Homology or template based modeling has been the most successful method for protein structure prediction in the critical assessment of protein structure prediction (CASP) experiments (6,7). The power of this technique progressively increases as ever more structures are solved by world-wide structural genomics initiatives (8,9). Nevertheless, obtaining a model with the same accuracy as a crystal structure is still an unsolved problem: structure refinement of a rough model (within 1-3 Å rmsd) to bring it closer to the native structure remains a major challenge (6,10).…”
mentioning
confidence: 99%
“…Commercial software for homology modelling is widely available. Approximately 1,000 protein folds have been discovered so far; Levitt [15] predicts that the maximum number of folds is likely to be 1,613.…”
Section: Homology Methodsmentioning
confidence: 99%