2020
DOI: 10.1051/epjconf/202024503010
|View full text |Cite
|
Sign up to set email alerts
|

Managing the ATLAS Grid through Harvester

Abstract: ATLAS Computing Management has identified the migration of all computing resources to Harvester, PanDA’s new workload submission engine, as a critical milestone for LHC Run 3 and 4. This contribution will focus on the Grid migration to Harvester. We have built a redundant architecture based on CERN IT’s common offerings (e.g. Openstack Virtual Machines and Database on Demand) to run the necessary Harvester and HTCondor services, capable of sustaining the load of O(1M) workers on the Grid per day. We have revie… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5

Relationship

3
2

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 5 publications
0
4
0
Order By: Relevance
“…We assume that for each experiment an HTCondor server is available and that the experiments' computing systems are already configured to send jobs to a specific HTCondor system, for example ATLAS submits via Harvester [12] and Belle-II via a DIRAC site director [13].…”
Section: Csvas a Wlcg Compute Sitementioning
confidence: 99%
“…We assume that for each experiment an HTCondor server is available and that the experiments' computing systems are already configured to send jobs to a specific HTCondor system, for example ATLAS submits via Harvester [12] and Belle-II via a DIRAC site director [13].…”
Section: Csvas a Wlcg Compute Sitementioning
confidence: 99%
“…To mitigate this problem, it was decided to use Elasticsearch, Logstash and Kibana (ELK) stack for global monitoring of Harvester. The data is copied every 10 minutes from the PanDA DB to the central Elasticsearch storage provided by CERN IT [11]. The implemented views of the Harvester monitoring module are mostly used for debugging immediate problems related to a particular worker or PanDA job because BigPanDA monitor reads data directly from the PanDA database without additional delays.…”
Section: Harvester Monitoring Modulementioning
confidence: 99%
“…The workload management system PanDA is used to process, broker and dispatch all ATLAS workloads across the heterogeneous WLCG and opportunistic resources. The interface to the resources is handled by Harvester [14], which was implemented following a flexible plugin approach so that it can be easily expanded for any available resource. During the first proof of concept phase with Google in 2017 [15] the ATLAS payloads on Google Compute Engine (GCE) had to be integrated in the most native and lightweight way possible.…”
Section: Integration Of Kubernetes With the Atlas Workload Management Systemmentioning
confidence: 99%