SC14: International Conference for High Performance Computing, Networking, Storage and Analysis 2014
DOI: 10.1109/sc.2014.42
|View full text |Cite
|
Sign up to set email alerts
|

Orion: Scaling Genomic Sequence Matching with Fine-Grained Parallelization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2015
2015
2019
2019

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 14 publications
(16 citation statements)
references
References 21 publications
0
16
0
Order By: Relevance
“…Massive volumes of sequences maintained in the database to be searched induces additional computation burden. BLAST is a widely adopted bioinformatics tool for sequence alignment which perform faster alignments, at expense of accuracy (possibly missing some potential hits) [18,19]. Experiment are conducted to evaluate MOHMR and HMR performance for performing gene sequence alignment.…”
Section: Bioinformatics Application Performance Evaluationmentioning
confidence: 99%
See 1 more Smart Citation
“…Massive volumes of sequences maintained in the database to be searched induces additional computation burden. BLAST is a widely adopted bioinformatics tool for sequence alignment which perform faster alignments, at expense of accuracy (possibly missing some potential hits) [18,19]. Experiment are conducted to evaluate MOHMR and HMR performance for performing gene sequence alignment.…”
Section: Bioinformatics Application Performance Evaluationmentioning
confidence: 99%
“…The dataset for experiment analysis is obtained from NCBI [20]. For performing alignment Drosophila database as a reference database and Query sequence of varied sizes of from Homo sapiens chromosomal sequences and genomic scaffolds is considered similar to [19] which are tabulated in Table 1. All four experiment are conducted using BLAST algorithm on HMR and MOHMR frameworks.…”
Section: Bioinformatics Application Performance Evaluationmentioning
confidence: 99%
“…The resulting alignments can then be combined using a custom aggregation function that eliminates duplicates and splices together overlapping alignments. We use the strategy proposed by Mahadik et al [26] to implement the partition and aggregate functions. Figure 11 provides the pseudo code for partition and aggregate functions.…”
Section: Partition-aggregate Functionsmentioning
confidence: 99%
“…As the amount of genomic data rapidly increases, many applications in these categories are facing severe challenges in scaling up to handle these large datasets. For example, the parallel version of the de facto local sequence alignment application called BLAST suffers from an exponential increase in matching time with the increasing size of the query sequence-the knee of the curve is reached at a query size of only 2 Mega base pairs (Mbp) ( Figure 3 in [26]). For calibration, the smallest human chromosome (chromosome 22) is 49 Mbp long.…”
Section: Introductionmentioning
confidence: 99%
“…However, in this approach the scaling of the index table construction is limited by the size of each subgroup. Orion [35] is an improvement over mpiBLAST [36] and scales the sequence matching with fine-grained parallelization. However, Orion uses mpiBLAST's mpiformatdb tool to format and to shard the database and this process is serial.…”
Section: Related Workmentioning
confidence: 99%