2016
DOI: 10.1186/s13059-016-1037-6
|View full text |Cite
|
Sign up to set email alerts
|

An expanded evaluation of protein function prediction methods shows an improvement in accuracy

Abstract: BackgroundA major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging.ResultsWe conducted the second critical assessment of functional an… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

6
478
0
5

Year Published

2016
2016
2021
2021

Publication Types

Select...
5
2

Relationship

3
4

Authors

Journals

citations
Cited by 371 publications
(494 citation statements)
references
References 19 publications
6
478
0
5
Order By: Relevance
“…Functional annotations relating to musculoskeletal disease, especially OA, are poor and result in spurious descriptors. These important issues have been realized8 and methods to improve annotations are being developed 9…”
Section: Biology As a Systemmentioning
confidence: 99%
“…Functional annotations relating to musculoskeletal disease, especially OA, are poor and result in spurious descriptors. These important issues have been realized8 and methods to improve annotations are being developed 9…”
Section: Biology As a Systemmentioning
confidence: 99%
“…These algorithms are recognized as powerful alternative method for the functional prediction of both proteins [66][67][68][69][70] and other biomolecules [71]. However, over one third of the protein sequences in the UniProt [26] are still labeled as "putative", "uncharacterized", "unknown function" or "hypothetical", and the difficulty in discovering the functional class of the remaining proteins are reported to come from the false discovery rate of the in-silico methods [55,56,72]. Moreover, the identification accuracies of those approaches still need to be further improved [55,56,73].…”
Section: Introductionmentioning
confidence: 99%
“…However, over one third of the protein sequences in the UniProt [26] are still labeled as "putative", "uncharacterized", "unknown function" or "hypothetical", and the difficulty in discovering the functional class of the remaining proteins are reported to come from the false discovery rate of the in-silico methods [55,56,72]. Moreover, the identification accuracies of those approaches still need to be further improved [55,56,73]. Thus, it is urgently necessary to assess the identification accuracies and false discovery rates among those different in-silico approaches.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…ML approaches are the state of the art in most non-classic prediction challenges. These methods are applied in community annotation challenges such as Critical Assessment of protein Function Annotation (CAFA) (5,6), and Critical Assessment for Information Extraction in Biology (BioCreAtIvE) (7). ML approaches actually benefit from the growth of available sequences, while 'brittle' rulebased methods often fail to cope with the growing variability and quantity of possible annotations and sequences.…”
Section: Introductionmentioning
confidence: 99%