2020
DOI: 10.1007/s41781-020-0036-1
|View full text |Cite
|
Sign up to set email alerts
|

High-Throughput Cloud Computing with the Cloudscheduler VM Provisioning Service

Abstract: We describe a high-throughput computing system for running jobs on public and private computing clouds using the HTCondor job scheduler and the cloudscheduler VM provisioning service. The distributed cloud computing system is designed to simultaneously use dedicated and opportunistic cloud resources at local and remote locations. It has been used for large-scale production particle physics workloads for many years using thousands of cores on three continents. A decade after its initial design and implementatio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 21 publications
0
4
0
Order By: Relevance
“…In addition, local NFS server were setup to provide the software infrastructure and home directories. The High Energy Physics Research Computing (HEP-RC) 6 group at UVic uses cloudscheduler to manage VMs on Openstack clouds on demand depending on jobs in an HTCondor queue [6] [7]. To make use of the same system, HTCondor was adapted as batch system and wrapper scripts written to translate LSF/TORQUE/MAUI commands to HTCondor commands.…”
Section: Preservation Of the Analysis Frameworkmentioning
confidence: 99%
“…In addition, local NFS server were setup to provide the software infrastructure and home directories. The High Energy Physics Research Computing (HEP-RC) 6 group at UVic uses cloudscheduler to manage VMs on Openstack clouds on demand depending on jobs in an HTCondor queue [6] [7]. To make use of the same system, HTCondor was adapted as batch system and wrapper scripts written to translate LSF/TORQUE/MAUI commands to HTCondor commands.…”
Section: Preservation Of the Analysis Frameworkmentioning
confidence: 99%
“…While integrating opportunistic resources into an existing OBS makes resources directly accessible to users, it also restricts access to the specific group owning the OBS. For example, the meta-scheduler Cloud Scheduler [4] directly integrates resources into the distributed job submission infrastructure of specific collaborations.…”
Section: Single Point Of Entry and The Intermediate Obsmentioning
confidence: 99%
“…We use MinIO [10] to provide the object storage on single VM instances started on different clouds, each with storage space at the order of 100GB, as well as for a distributed multi-host cluster with a storage capacity of about 45TB. The most used data (in general files needed for most Figure 2: Cloudscheduler schema, Cloudscheduler [2] polls the job scheduler for job requirements and starts/stops VMs on attached clouds when needed depending on those requirements. simulation jobs) is replicated manually to the different MinIO instances and then instantly available via Dynafed as an additional copy.…”
Section: Uvic Instance For Belle-iimentioning
confidence: 99%
“…Instead of using static bare-metal worker nodes, our worker nodes are VMs started dynamically on distributed clouds depending on the resource requirements of a job. The management of the VMs and cloud resources is done by Cloudscheduler [2]. Fig.…”
Section: Introductionmentioning
confidence: 99%