2004
DOI: 10.1110/ps.04634604
|View full text |Cite
|
Sign up to set email alerts
|

Sequence‐structure mapping errors in the PDB: OB‐fold domains

Abstract: The Protein Data Bank (PDB) is the single most important repository of structural data for proteins and other biologically relevant molecules. Therefore, it is critically important to keep the PDB data, as much as possible, error-free. In this study, we have analyzed PDB crystal structures possessing oligonucleotide/ oligosaccharide binding (OB)-fold, one of the highly populated folds, for the presence of sequence-structure mapping errors. Using energy-based structure quality assessment coupled with sequence a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0
2

Year Published

2005
2005
2020
2020

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 16 publications
(19 citation statements)
references
References 34 publications
0
17
0
2
Order By: Relevance
“…Existing database curation policies must account for the potential for error propagation and incorporate standardization procedures that can correct for errors when they arise [129,130], for example by using ProsaII [131] to evaluate sequence-structure compatibility of PDB entries and identify errors [132]. While the type of crowdsourcing error correction exemplified by Venclovas et al can be helpful, we argue that it shouldn't be relied upon [132]; curators should preemptively establish policies to help identify, control, and prevent errors.…”
Section: Challenge: Address the Inconsistent Quality Of Existing Datamentioning
confidence: 95%
“…Existing database curation policies must account for the potential for error propagation and incorporate standardization procedures that can correct for errors when they arise [129,130], for example by using ProsaII [131] to evaluate sequence-structure compatibility of PDB entries and identify errors [132]. While the type of crowdsourcing error correction exemplified by Venclovas et al can be helpful, we argue that it shouldn't be relied upon [132]; curators should preemptively establish policies to help identify, control, and prevent errors.…”
Section: Challenge: Address the Inconsistent Quality Of Existing Datamentioning
confidence: 95%
“…4 One hairpin loop and two flexible long loops extend out from the central β-barrel core. [4][5][6] The C-terminal region of the SSB consisting of numerous acidic residues is highly disordered [7][8][9] and has been proposed to interact with auxiliary proteins and influence their binding mode to ssDNA. 10,11 Most bacterial SSBs and the human mitochondrial SSB 12 are active as a homotetramer where four OB-folds function cooperatively to bind ssDNA.…”
Section: Introductionmentioning
confidence: 99%
“…This will not only help community to leverage this information in new pathway designs, but also increase the visibility of the published results themselves. The Protein Data Bank, a database containing standardized information on macromolecular crystal structures, is an excellent example of how standardization and centralized storage promote re-use of data, but also large-scale correction or refinement 16,17 . Even if no fulltime salaried curators are available, this could still be done by crowdsourcing community involvement in curation/annotation hackathons or even by organizing large-scale and parallelized annotation efforts by undergraduate students 18 .…”
Section: Standardization For Natural Product Biosynthesismentioning
confidence: 99%