Proceedings of the Tenth European Conference on Computer Systems 2015
DOI: 10.1145/2741948.2741964
|View full text |Cite
|
Sign up to set email alerts
|

Large-scale cluster management at Google with Borg

Abstract: Google's Borg system is a cluster manager that runs hundreds of thousands of jobs, from many thousands of different applications, across a number of clusters each with up to tens of thousands of machines.It achieves high utilization by combining admission control, efficient task-packing, over-commitment, and machine sharing with process-level performance isolation. It supports high-availability applications with runtime features that minimize fault-recovery time, and scheduling policies that reduce the probabi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
606
0
7

Year Published

2016
2016
2021
2021

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 987 publications
(614 citation statements)
references
References 54 publications
1
606
0
7
Order By: Relevance
“…In actual practice the cloud infrastructure is the enabler of these essential characteristics. Due to the economy of scale, infrastructure installations are large and typically built of commodity hardware so that failures are the norm rather than the exception [Verma et al (2015)]. Finally, cloud applications often rely on third-party services, as part of the application functionality, support (e.g., monitoring) or both.…”
Section: Cna: Definitions and Requirementsmentioning
confidence: 99%
“…In actual practice the cloud infrastructure is the enabler of these essential characteristics. Due to the economy of scale, infrastructure installations are large and typically built of commodity hardware so that failures are the norm rather than the exception [Verma et al (2015)]. Finally, cloud applications often rely on third-party services, as part of the application functionality, support (e.g., monitoring) or both.…”
Section: Cna: Definitions and Requirementsmentioning
confidence: 99%
“…Based on the lightweight architecture of container-based virtualization, containers offer several advantages over virtual machines such as high performance, resource efficiency, and agile environment. As a result of these advantages, containers have been adopted in the IT industry in areas such as cloud data centers, mobile systems, and networks [16][17][18].…”
Section: Background: Container-based Virtualizationmentioning
confidence: 99%
“…Choosing Mesos with Marathon [13] is an excellent combination if you already have a Mesos cluster. Kubernetes [14] is the third container-management system for automating deployment, scaling, and management of containerized applications influenced by Borg [9]. It aims to provide better ways of managing related, distributed components across varied infrastructure, while still benefiting from the improved utilization containers enable.…”
Section: Related Workmentioning
confidence: 99%
“…The development of containerization technology has greatly accelerated the popularization and application of deep learning. To put deep learning task into container and make the container as the unit of management can enable broader and easier deployment of deep learning algorithms for building cloud-based distributed deep learning platform [9]. In addition, it enables higher levels of utilization and can transition from training to serving smoothly.…”
Section: Introductionmentioning
confidence: 99%