2013
DOI: 10.1371/journal.pbio.1001638
|View full text |Cite
|
Sign up to set email alerts
|

The COMBREX Project: Design, Methodology, and Initial Results

Abstract: Experimental data exists for only a vanishingly small fraction of sequenced microbial genes. This community page discusses the progress made by the COMBREX project to address this important issue using both computational and experimental resources.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
59
0

Year Published

2014
2014
2017
2017

Publication Types

Select...
5
1

Relationship

3
3

Authors

Journals

citations
Cited by 55 publications
(59 citation statements)
references
References 31 publications
0
59
0
Order By: Relevance
“…Recent advances in the efficiency and speed of gene sequencing coupled to a comparatively much slower pace of acquiring experimental evidence for protein function have given rise to a large number of hypothetical or erroneously annotated proteins in the databases (1,2). An example of a family of enzymes with a sizable amount of hypothetical proteins in the database is the nitronate monooxygenases (NMOs), with over 5000 genes in the GenBank TM currently annotated as NMO.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Recent advances in the efficiency and speed of gene sequencing coupled to a comparatively much slower pace of acquiring experimental evidence for protein function have given rise to a large number of hypothetical or erroneously annotated proteins in the databases (1,2). An example of a family of enzymes with a sizable amount of hypothetical proteins in the database is the nitronate monooxygenases (NMOs), with over 5000 genes in the GenBank TM currently annotated as NMO.…”
Section: Discussionmentioning
confidence: 99%
“…The discrepancy between the rapid increase in the number of sequenced genomes of prokaryotes and the slower experimental determination of protein function has resulted in the presence of a large number of hypothetical proteins in the databases, with gene function prediction often unreliable (1,2). One case of an enzyme family consisting mainly of hypothetical proteins is represented by the nitronate monooxygenases (NMOs, 2 EC 1.13.12.16), which includes Ͼ5000 genes in the GenBank TM .…”
mentioning
confidence: 99%
“…Over 99 % of all annotations are created in this manner, and they are applied to approximately 76 % of all genes [ 6 ]-the remaining 24 % of genes typically have no annotation or are listed as "hypothetical protein." With the exponential growth of biological databases and the labor-intensive nature of manual curation, it is inevitable that automated computational predictions will provide the vast majority of annotations populating current and future databases.…”
Section: Sources Of Gene Ontology Annotations: Curated and Computatiomentioning
confidence: 99%
“…One such attempt at community building is focused on bacterial proteins: COMBREX ( COM putational BR idge to Ex periments), along with additional efforts such as the Enzyme Function Initiative [ 19 ]. The database ( http://combrex.bu.edu ) classifi es the gene function status of 3.3 million bacterial genes, including 13,665 proteins that have experimentally determined functions [ 6 ]. The database contains traceable statements to experimentally characterized proteins, thereby providing support for a given annotation in a clear and transparent manner.…”
Section: Approaches To Test Computational Predictions With Experimentmentioning
confidence: 99%
See 1 more Smart Citation