2012
DOI: 10.1093/bioinformatics/bts372
|View full text |Cite
|
Sign up to set email alerts
|

An approach to describing and analysing bulk biological annotation quality: a case study using UniProtKB

Abstract: Motivation: Annotations are a key feature of many biological databases, used to convey our knowledge of a sequence to the reader. Ideally, annotations are curated manually, however manual curation is costly, time consuming and requires expert knowledge and training. Given these issues and the exponential increase of data, many databases implement automated annotation pipelines in an attempt to avoid un-annotated entries. Both manual and automated annotations vary in quality between databases and annotators, ma… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(14 citation statements)
references
References 31 publications
0
14
0
Order By: Relevance
“…casualty numbers in armed conflicts (Bohorquez, Gourley, Dixon, Spagat, and Johnson 2009;Friedman 2015), Figure 1b; comparing manually curated databases with automatically curated biological databases (Bell, Gillespie, Swan, and Lord 2012), Figure 1c.…”
Section: Wordsmentioning
confidence: 99%
“…casualty numbers in armed conflicts (Bohorquez, Gourley, Dixon, Spagat, and Johnson 2009;Friedman 2015), Figure 1b; comparing manually curated databases with automatically curated biological databases (Bell, Gillespie, Swan, and Lord 2012), Figure 1c.…”
Section: Wordsmentioning
confidence: 99%
“…Despite the large number of biological and medical databases, there are a few papers focused on their quality, assessment, and control. There are some exceptions, for example, in controlling data deposition quality in the PRIDE proteomics repository [Csordas et al, 2012], and developing methods for assessing annotation quality at UniProtKB [Bell et al, 2012]. Otherwise, databases such as ClinVar [Landrum et al, 2014] typically include discussion of validation and standardization in more general papers.…”
Section: Introductionmentioning
confidence: 99%
“…The work described here shows the value and importance of historical records, and that this value is also relevant to the present. We have previously made extensive use of historical records when looking at trends in database word usage ( Bell et al , 2012 ), as have others to determine when a database might be complete ( Baumgartner et al , 2007 ), or to assay the accuracy of predictive tools ( Gross et al , 2009 ). These analyses have dealt with both the structured (GO) and unstructured (comments) components of annotation.…”
Section: Discussionmentioning
confidence: 99%