Cache-aware load balancing of data center applications

Archer, Aaron; Aydin, Kevin; Bateni, MohammadHossein; Mirrokni, Vahab; Schild, Aaron; Yang, Ray Yeng; Zhuang, Richard

doi:10.14778/3311880.3311887

Cited by 18 publications

(5 citation statements)

References 67 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A similar application is to boost IO throughput in the Google search engine backend by improving cache utilization. Archer et al [15] initialize a voting table that assigns search requests, using graph partitioning on a bipartite graph where vertices are search terms and queries, and an edge exists for each term contained in a query. Subsequent simulation-and-refinement further boosts the prediction accuracy (item in cache) of the voting table.…”

Section: Applicationsmentioning

confidence: 99%

More Recent Advances in (Hyper)Graph Partitioning

Çatalyürek¹,

Devine²,

Faraj³

et al. 2022

Preprint

View full text Add to dashboard Cite

In recent years, significant advances have been made in the design and evaluation of balanced (hyper)graph partitioning algorithms. We survey trends of the last decade in practical algorithms for balanced (hyper)graph partitioning together with future research directions. Our work serves as an update to a previous survey on the topic [34]. In particular, the survey extends the previous survey by also covering hypergraph partitioning and streaming algorithms, and has an additional focus on parallel algorithms.

show abstract

Section: Applicationsmentioning

confidence: 99%

More Recent Advances in (Hyper)Graph Partitioning

Çatalyürek¹,

Devine²,

Faraj³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…In this work, we consider the connectivity metric e∈E (λ(e) − 1)ω(e) where λ(e) := {V i | e ∩ V i = ∅} denotes the number of different blocks connected by hyperedge e ∈ E and ω(e) denotes its weight. Often balanced partitioning is used as an acceleration technique for other applications, such as quantum circuit simulation [31], sharding distributed databases [15,36], load balancing (for scientific computing) [13], route planning [17,32], or boosting cache utilization in a search engine backend [7].…”

Section: Introductionmentioning

confidence: 99%

Parallel Flow-Based Hypergraph Partitioning

Gottesbüren¹,

Heuer²,

Sanders³

2022

Preprint

View full text Add to dashboard Cite

We present a shared-memory parallelization of flow-based refinement, which is considered the most powerful iterative improvement technique for hypergraph partitioning at the moment. Flow-based refinement works on bipartitions, so current sequential partitioners schedule it on different block pairs to improve k-way partitions. We investigate two different sources of parallelism: a parallel scheduling scheme and a parallel maximum flow algorithm based on the well-known push-relabel algorithm. In addition to thoroughly engineered implementations, we propose several optimizations that substantially accelerate the algorithm in practice, enabling the use on extremely large hypergraphs (up to 1 billion pins). We integrate our approach in the state-of-the-art parallel multilevel framework Mt-KaHyPar and conduct extensive experiments on a benchmark set of more than 500 real-world hypergraphs, to show that the partition quality of our code is on par with the highest quality sequential code (KaHyPar), while being an order of magnitude faster with 10 threads.

show abstract

“…Caching is a fundamental technique for boosting systems performance. In particular, software-managed caches, aka software caches, are employed in multiple data-stores and databases [2,12,15,19,20,21,36,40,42,41], operating systems, middleware, streaming services, and is a major capability of edge computing. The common motivation behind caching is to store data closer to the application than its source and avoid recalculating queries, query plans, and temporal indices.…”

Section: Introductionmentioning

confidence: 99%

Limited Associativity Caching in the Data Plane

Friedman¹,

Goaz²,

Hovav³

2022

Preprint

View full text Add to dashboard Cite

In-network caching promises to improve the performance of networked and edge applications as it shortens the paths data need to travel. This is by storing so-called hot items in the network switches on-route between clients who access the data and the storage servers who maintain it. Since the data flows through those switches in any case, it is natural to cache hot items there.Most software-managed caches treat the cache as a fully associative region. Alas, a fully associative design seems to be at odds with programmable switches' goal of handling packets in a short bounded amount of time, as well as their restricted programming model. In this work, we present PKache, a generic limited associativity cache implementation in the programmable switches' domain-specific P4 language, and demonstrate its utility by realizing multiple popular cache management schemes.

show abstract

Cache-aware load balancing of data center applications

Cited by 18 publications

References 67 publications

More Recent Advances in (Hyper)Graph Partitioning

More Recent Advances in (Hyper)Graph Partitioning

Parallel Flow-Based Hypergraph Partitioning

Limited Associativity Caching in the Data Plane

Contact Info

Product

Resources

About