Mitigating the Performance-Efficiency Tradeoff in Resilient Memory Disaggregation

Lee, Youngmoon; Maruf, Hasan Al; Chowdhury, Mosharaf; Cidon, Asaf; Shin, Kang G.

doi:10.48550/arxiv.1910.09727

Cited by 2 publications

(5 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…DDA and SR-IOV require the whole address range of a VM to be statically pinned in hypervisor-level page tables [15,[75][76][77][78][79]. As a consequence, memory disaggregation cannot rely on hypervisor page faults, which would be needed to deploy existing RDMA-based systems at the platform level [29][30][31][32][33][34][35][36][37]. For the same reason, we cannot migrate pages between local memory and a pool as used in existing two-tier memory systems [31,[49][50][51][52].…”

Section: Design Goals and Requirementsmentioning

confidence: 99%

“…ThymesisFlow advocates application changes for performance, while we focuse on platform-level MLdriven pool memory management that is transparent to users. Hypervisor/OS level disaggregation: Hypervisor/OS level approaches [11,[28][29][30][31][32][33][34][35][36][37] rely on page faults and access monitoring to maintain the working set in local DRAM. Such OSbased approaches bring significant overhead, jitter, and are incompatible with virtualization acceleration (e.g., DDA).…”

Section: Related Workmentioning

confidence: 99%

“…Due to these requirements, most of the prior memory disaggregation work does not apply: custom hardware-based designs [13,[17][18][19][20], systems that require changes to the VM guest [11,[21][22][23][24][25][26][27][28][29][30][31][32][33][34][35][36][37], and implementations that rely on page faults [15] are not deployable in the cloud today (see §4.1).…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

First-generation Memory Disaggregation for Cloud Platforms

Li¹,

Berger²,

Novaković³

et al. 2022

Preprint

View full text Add to dashboard Cite

In Azure, up to 25% of memory is stranded, i.e., it is leftover after the servers' cores have been rented to VMs. Memory disaggregation promises to reduce this stranding. However, making disaggregation practical for production cloud deployment remains challenging. For example, RDMA-based disaggregation involves too much overhead for common workloads and transparent latency management is incompatible with virtualization acceleration. The emerging Compute Express Link (CXL) standard offers a low-overhead substrate to build memory disaggregation while overcoming these challenges. This paper proposes a first-generation CXL-based disaggregation system that meets the requirements of cloud providers. Our system includes a memory pool controller, and prediction-based system software and distributed control plane designs. Its predictions of VM latency sensitivity and memory usage allow it to split workloads across local and pooled memory while mitigating the higher pool latency.Our analysis of production clusters shows that small pools of 8-32 sockets are sufficient to reduce stranding significantly. It also shows that ∼50% of all VMs never touch 50% of their rented memory. In emulated experiments with 150+ workloads, we show our pooling approach incurs a configurable performance loss between 1-5%. Finally, we show that disaggregation can achieve a 9-10% reduction in overall DRAM, which represents hundreds of millions of dollars in cost savings for a large cloud provider.

show abstract

Section: Design Goals and Requirementsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

First-generation Memory Disaggregation for Cloud Platforms

Li¹,

Berger²,

Novaković³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…In our prototype, we do not assume that we are given a reliable disaggregated memory [54,91], but rather show how to implement a reliable disaggregated memory using RDMA. 2 To do so, we assume 2𝑓 𝑚 +1 memory nodes out of which 𝑓 𝑚 can fail.…”

Section: Modelmentioning

confidence: 99%

“…Carbink [91] and Hydra [54] build reliable disaggregated memory to improve memory utilization in a cluster, albeit without support for concurrent shared access. MIND [53], GAM [21] and Clover [85], on the other hand, provide reliable shared memory, but they do not tolerate Byzantine writers.…”

Section: Related Workmentioning

confidence: 99%

uBFT: Microsecond-scale BFT using Disaggregated Memory

Aguilera¹,

Ben-David²,

Guerraoui³

et al. 2022

Preprint

View full text Add to dashboard Cite

We propose uBFT, the first State Machine Replication (SMR) system to achieve microsecond-scale latency in data centers, while using only 2𝑓 +1 replicas to tolerate 𝑓 Byzantine failures. The Byzantine Fault Tolerance (BFT) provided by uBFT is essential as pure crashes appear to be a mere illusion with real-life systems reportedly failing in many unexpected ways. uBFT relies on a small non-tailored trusted computing base-disaggregated memory-and consumes a practically bounded amount of memory. uBFT is based on a novel abstraction called Consistent Tail Broadcast, which we use to prevent equivocation while bounding memory. We implement uBFT using RDMA-based disaggregated memory and obtain an end-toend latency of as little as 10µs. This is at least 50× faster than MinBFT, a state of the art 2𝑓 +1 BFT SMR based on Intel's SGX. We use uBFT to replicate two KV-stores (Memcached and Redis), as well as a financial order matching engine (Liquibook). These applications have low latency (up to 20µs) and become Byzantine tolerant with as little as 10µs more. The price for uBFT is a small amount of reliable disaggregated memory (less than 1 MiB), which in our prototype consists of a small number of memory servers connected through RDMA and replicated for fault tolerance.

show abstract

Mitigating the Performance-Efficiency Tradeoff in Resilient Memory Disaggregation

Cited by 2 publications

References 22 publications

First-generation Memory Disaggregation for Cloud Platforms

First-generation Memory Disaggregation for Cloud Platforms

uBFT: Microsecond-scale BFT using Disaggregated Memory

Contact Info

Product

Resources

About