Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06) 2006
DOI: 10.1109/ccgrid.2006.1630934
|View full text |Cite
|
Sign up to set email alerts
|

Using multiple grid resources for bioinformatics applications in GADU

Abstract: During the past decade, the scientific community has witnessed the rapid accumulation of gene sequence data and data related to physiology and biochemistry of organisms. Bioinformatics tools used for efficient and computationally intensive analysis of genetic sequences require large-scale computational resources to accommodate the growing data. Grid computational resources such as the Open Science Grid and TeraGrid have proved useful for scientific discovery. GADU is a high-throughput computational system deve… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2006
2006
2014
2014

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 23 publications
0
2
0
Order By: Relevance
“…The Genome Analysis and Database Update system (GADU) provides an automated, scalable, high-throughput computational workflow engine that executes bioinformatics tools (BLAST, BLOCKS, PFam, Chisel and InterPro) with public databases (NCBI RefSeq, PIR, InterPro and KEGG) on multiple Grids of different architectures and environment, a collective member of more than 18,000 CPUs contributed by more than 60 institutions [31]. …”
Section: Computing Gridmentioning
confidence: 99%
“…The Genome Analysis and Database Update system (GADU) provides an automated, scalable, high-throughput computational workflow engine that executes bioinformatics tools (BLAST, BLOCKS, PFam, Chisel and InterPro) with public databases (NCBI RefSeq, PIR, InterPro and KEGG) on multiple Grids of different architectures and environment, a collective member of more than 18,000 CPUs contributed by more than 60 institutions [31]. …”
Section: Computing Gridmentioning
confidence: 99%
“…Many GRID BLAST implementations have been developed and reported [25][26][27][28][29][30]. The characteristics of Grid Blast are summarized as follows: (i) prestag-ing of sequence databases to minimize the runtime overhead of transferal of large sequence databases, which often reach several Gigabytes in size, (ii) databases update, which keeps data consistency on the data grid, (iii) dynamic load balancing of query sequences to avoid unexpected slow responses, especially when dealing with thousands of query sequences in heterogeneous computation pools including PC-clusters and desktop computers, and (iv) assembling of the results from distributed jobs.…”
Section: Homology Searchmentioning
confidence: 99%