2020
DOI: 10.1051/epjconf/202024505042
|View full text |Cite
|
Sign up to set email alerts
|

Raythena: a vertically integrated scheduler for ATLAS applications on heterogeneous distributed resources

Abstract: The ATLAS experiment has successfully integrated HighPerformance Computing resources (HPCs) in its production system. Unlike the current generation of HPC systems, and the LHC computing grid, the next generation of supercomputers is expected to be extremely heterogeneous in nature: different systems will have radically different architectures, and most of them will provide partitions optimized for different kinds of workloads. In this work we explore the applicability of concepts and tools realized in Ray (the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
1
1

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 4 publications
0
2
0
Order By: Relevance
“…A further challenge is to keep track of the progress of many independent applications processing small chunks of events, where the applications have the ability to stop work after any chunk has finished or use some other mechanism to deal with early termination. One way to achieve this goal is to integrate the application framework with a workflow management system as demonstrated by the Raythena/EventService project [14].…”
Section: Hpc Sitesmentioning
confidence: 99%
“…A further challenge is to keep track of the progress of many independent applications processing small chunks of events, where the applications have the ability to stop work after any chunk has finished or use some other mechanism to deal with early termination. One way to achieve this goal is to integrate the application framework with a workflow management system as demonstrated by the Raythena/EventService project [14].…”
Section: Hpc Sitesmentioning
confidence: 99%
“…As an intermediate step, we have implemented Raythena [5] a task farm managing multiple AthenaMP processes within a single HPC allocation. This layer of scheduling requires a lot of boilerplate components to stay compatible with the WLCG.…”
Section: Introductionmentioning
confidence: 99%