2008
DOI: 10.1007/978-3-540-88871-0_54
|View full text |Cite
|
Sign up to set email alerts
|

Managing Very-Large Distributed Datasets

Abstract: Abstract. In this paper, we introduce a system for handling very large datasets, which need to be stored across multiple computing sites. Data distribution introduces complex management issues, particularly as computing sites may make use of different storage systems with different internal organizations. The motivation for our work is the ATLAS Experiment for the Large Hadron Collider (LHC) at CERN, where the authors are involved in developing the data management middleware. This middleware, called DQ2, is ch… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
4
0

Year Published

2009
2009
2011
2011

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 18 publications
0
4
0
Order By: Relevance
“…Our previous work investigated the system behaviour of DQ2 and we found a direct dependence of concurrent bursts on global performance [20], [21]. Unfortunately, it is difficult to effectively manage such bursts in the system [8], due to the heterogeneity of involved services. Furthermore, the principles of burst identification and prediction are likely to assist modelling of arbitrary types of workloads, respectively any time series [12], [31].…”
Section: Introductionmentioning
confidence: 98%
See 2 more Smart Citations
“…Our previous work investigated the system behaviour of DQ2 and we found a direct dependence of concurrent bursts on global performance [20], [21]. Unfortunately, it is difficult to effectively manage such bursts in the system [8], due to the heterogeneity of involved services. Furthermore, the principles of burst identification and prediction are likely to assist modelling of arbitrary types of workloads, respectively any time series [12], [31].…”
Section: Introductionmentioning
confidence: 98%
“…To date, DQ2 is considered one of the largest open data management environments ever built and an example of a global multigrid / cloud hybrid system [9]. Our previous work investigated the system behaviour of DQ2 and we found a direct dependence of concurrent bursts on global performance [20], [21].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…U.S. Open Science Grid (OSG), Tera Grid, and the EU's European grid (Euro Grid) focus on computing. American physical grid (GriPhyN) [4] and the Euro Data Grid are typical data grid [5][6][7]. Grid technology has also been in a rapid development in China.…”
Section: Introductionmentioning
confidence: 99%