2019
DOI: 10.1051/epjconf/201921404024
|View full text |Cite
|
Sign up to set email alerts
|

Architecture and prototype of a WLCG data lake for HL-LHC

Abstract: The computing strategy document for HL-LHC identifies storage as one of the main WLCG challenges in one decade from now. In the naive assumption of applying today's computing model, the ATLAS and CMS experiments will need one order of magnitude more storage resources than what could be realistically provided by the funding agencies at the same cost of today. The evolution of the computing facilities and the way storage will be organized and consolidated will play a key role in how this possible shortage of res… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
25
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
3

Relationship

1
8

Authors

Journals

citations
Cited by 23 publications
(25 citation statements)
references
References 5 publications
0
25
0
Order By: Relevance
“…In addition, the usual approaches that work for configuration of compute might not work for network infrastructure due to various reasons. • Data Center Interconnect technologies (both HW and software-based) are quite novel approaches that will require initial deployments to evaluate how they could benefit inter-DC networking for federated use case such as data lakes [8]. • A range of other approaches that are only mentioned briefly in the report such as GPU, virtualized storage, hyper-converged architectures and edge services for HEP instrumentation and experiments will require initial testbeds and evaluations.…”
Section: Cloud Native Data Centre Networkmentioning
confidence: 99%
See 1 more Smart Citation
“…In addition, the usual approaches that work for configuration of compute might not work for network infrastructure due to various reasons. • Data Center Interconnect technologies (both HW and software-based) are quite novel approaches that will require initial deployments to evaluate how they could benefit inter-DC networking for federated use case such as data lakes [8]. • A range of other approaches that are only mentioned briefly in the report such as GPU, virtualized storage, hyper-converged architectures and edge services for HEP instrumentation and experiments will require initial testbeds and evaluations.…”
Section: Cloud Native Data Centre Networkmentioning
confidence: 99%
“…A cluster of such centres can then create a federated site that will be exposed behind a single endpoint/interface for the experiments. This transformation has already started within the WLCG data lakes activities and some of the participating sites are already running their storage and or compute in a federated setup [8].…”
Section: Programmable Networkmentioning
confidence: 99%
“…Though AGLT2 and NDGF sites operate over a decade, such distributed setups are exceptions. However, to reduce operational overhead, more and more small Tier-2s will come together to build so-called data lake [7].…”
Section: Federated Systemsmentioning
confidence: 99%
“…from supporting only purely CPU-bound workflows, we are now able to support a variety of workflows with emphasis on different type of resource and different level of CPU-, memory-, and I/O-intensity. We can leverage an enriched variety of workflows to setup a testbed for WLCG Data Organization, Management, and Access [10] performance studies, to test ideas for the HL-LHC [11].…”
Section: Hammercloud Architecture and Evolutionmentioning
confidence: 99%