2010
DOI: 10.1109/tpds.2009.49
|View full text |Cite
|
Sign up to set email alerts
|

All-Pairs: An Abstraction for Data-Intensive Computing on Campus Grids

Abstract: Abstract-Today, campus grids provide users with easy access to thousands of CPUs. However, it is not always easy for nonexpert users to harness these systems effectively. A large workload composed in what seems to be the obvious way by a naive user may accidentally abuse shared resources and achieve very poor performance. To address this problem, we argue that campus grids should provide end users with high-level abstractions that allow for the easy expression and efficient execution of data intensive workload… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
41
0
1

Year Published

2011
2011
2021
2021

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 59 publications
(42 citation statements)
references
References 30 publications
(33 reference statements)
0
41
0
1
Order By: Relevance
“…All-pairs is presented by Moretti et al [15] for data-intensive computing on campus grids. It provides an abstraction to users to deal with All-to-All Comparison Problems.…”
Section: Related Work and Motivationsmentioning
confidence: 99%
See 1 more Smart Citation
“…All-pairs is presented by Moretti et al [15] for data-intensive computing on campus grids. It provides an abstraction to users to deal with All-to-All Comparison Problems.…”
Section: Related Work and Motivationsmentioning
confidence: 99%
“…Among the existing frameworks, Hadoop [14] is popularly used to support the MapReduce computation pattern, but is inefficient in processing of All-to-All Comparison Problems. All-pairs [15] is designed for All-to-All Comparison Problems in a Campus Grid, but its application range is limited due to its brute-force data storage strategy, which stores all the data on all the worker nodes.…”
Section: Introductionmentioning
confidence: 99%
“…First, MapReduce only handles one-dimensional input and hence is not suitable for implementing both query segmentation and database segmentation approaches. Moretti et al has reported a similar observation that MapReduce is not sufficient to express all-to-all style computation [43]. The existing MapReduce BLAST implementation, i.e., CloudBLAST [44], only implements query segmentation and stores the entire database on each node.…”
Section: Mapreducementioning
confidence: 99%
“…Porting of ccc-gistemp to other scalable systems intended for data-intensive computing such as Dryad [12], All-Pairs [17] and Pregel [16] would provide a comparative study of the various programming abstractions that are suitable. Likewise implementations of MapReduce which use existing highperformance shared filesystems are now available (e.g.…”
Section: Further Workmentioning
confidence: 99%