2014
DOI: 10.1002/cpe.3264
|View full text |Cite
|
Sign up to set email alerts
|

Optimizing high performance computing workflow for protein functional annotation

Abstract: Functional annotation of newly sequenced genomes is one of the major challenges in modern biology. With modern sequencing technologies, the protein sequence universe is rapidly expanding. Newly sequenced bacterial genomes alone contain over 7.5 million proteins. The rate of data generation has far surpassed that of protein annotation. The volume of protein data makes manual curation infeasible, whereas a high compute cost limits the utility of existing automated approaches. In this work, we present an improved… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2014
2014
2016
2016

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 40 publications
0
2
0
Order By: Relevance
“…The paper on functional annotation of newly sequenced genomes describes an optimized workflow to enable large‐scale protein annotation. It utilizes a special classification algorithm and high performance computing (HPC), and the demonstrated results show capabilities which scientists will be able to utilize to annotate big genome data.…”
Section: Science and Engineering Trackmentioning
confidence: 99%
“…The paper on functional annotation of newly sequenced genomes describes an optimized workflow to enable large‐scale protein annotation. It utilizes a special classification algorithm and high performance computing (HPC), and the demonstrated results show capabilities which scientists will be able to utilize to annotate big genome data.…”
Section: Science and Engineering Trackmentioning
confidence: 99%
“…The large volume of sequencing data that are now available has created profound challenges in data transfer and analysis [ 3 ]. High throughput computing on supercomputers was recently introduced to meet these challenges [ 4 , 5 ]. However, high performance computing can be costly, and access to a supercomputing facility can be limited for small laboratories.…”
Section: Introductionmentioning
confidence: 99%