2020
DOI: 10.48550/arxiv.2008.08830
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Optimal Load Balancing in Bipartite Graphs

Wentao Weng,
Xingyu Zhou,
R. Srikant

Abstract: Applications in cloud platforms motivate the study of efficient load balancing under job-server constraints and server heterogeneity. In this paper, we study load balancing on a bipartite graph where left nodes correspond to job types and right nodes correspond to servers, with each edge indicating that a job type can be served by a server. Thus edges represent locality constraints, i.e., each job can only be served at servers which contained certain data and/or machine learning (ML) models. Servers in this sy… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
13
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(13 citation statements)
references
References 45 publications
0
13
0
Order By: Relevance
“…The main difficulty is that their expressions involve fractions of two sums that both span all states. We would also like to generalize these results in other directions, for instance by considering open variants of the algorithm or by accounting for assignment constraints [7,21].…”
Section: Discussionmentioning
confidence: 95%
See 1 more Smart Citation
“…The main difficulty is that their expressions involve fractions of two sums that both span all states. We would also like to generalize these results in other directions, for instance by considering open variants of the algorithm or by accounting for assignment constraints [7,21].…”
Section: Discussionmentioning
confidence: 95%
“…To achieve good performance without these strong assumptions, more recent works introduced speed-aware variants of the above-mentioned well-known algorithms. More specifically, [21] introduced variants of join-the-shortest-queue and join-idle-queue where the server speeds are used as a tie-breaking rule, and proved that these variants minimize the mean response time in the many-server regime; [10] proposed variants of power-of-d-choices and join-idle-queue for service systems with two server types (fast and slow) by adapting the degree of diversity and assignment probabilities to the server speeds, and proved stability, again in the many-server regime. Despite these advances, we still lack a fundamental understanding of the impact of heterogeneity on performance in service systems with a finite number of servers.…”
Section: Introductionmentioning
confidence: 99%
“…RELATED WORKS Online load balancing policies are fully investigated under classic settings, where multiple identical servers with exponentially distributed service rates process continuous arrived jobs. Based on CTMC and Lyapunov Stability theories, load balancing policies such as JSQ [14], JIQ [11], Pod [12], and JFIQ [9] are proposed and analyzed on the mean response time and cross-server communication overhead. In a most recent work [9], Weng et al proposed the JFSQ and JFIQ policies under the constraints of heterogenous service rates and service locality.…”
Section: B Simulation Resultsmentioning
confidence: 99%
“…Based on CTMC and Lyapunov Stability theories, load balancing policies such as JSQ [14], JIQ [11], Pod [12], and JFIQ [9] are proposed and analyzed on the mean response time and cross-server communication overhead. In a most recent work [9], Weng et al proposed the JFSQ and JFIQ policies under the constraints of heterogenous service rates and service locality. They show that, under a well-connected bipartite graph condition, these two policies can achieve the minimum mean response time in both the many-server regime and the sub Halfin-Whitt regime.…”
Section: B Simulation Resultsmentioning
confidence: 99%
See 1 more Smart Citation