Tales of the Tail

Li, Jialin; Sharma, Naveen; Ports, Dan R. K.; Gribble, Steven D.

doi:10.1145/2670979.2670988

Cited by 164 publications

(18 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The statemachine configuration for Memcached and Web-Search are represented in blue and red line, respectively, in Figure 2c. Figure 3 [30] and a QoS target that needs to be met. As shown in Figure 2 and 3, there exists a unique configuration for each load that optimizes energy efficiency.…”

Section: Motivationmentioning

confidence: 99%

Hipster: Hybrid Task Manager for Latency-Critical Cloud Workloads

Nishtala

Carpenter

Petrucci

et al. 2017

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

View full text Add to dashboard Cite

In 2013, U.S. data centers accounted for 2.2% of the country's total electricity consumption, a figure that is projected to increase rapidly over the next decade. Many important workloads are interactive, and they demand strict levels of quality-of-service (QoS) to meet user expectations, making it challenging to reduce power consumption due to increasing performance demands.This paper introduces Hipster, a technique that combines heuristics and reinforcement learning to manage latency-critical workloads. Hipster's goal is to improve resource efficiency in data centers while respecting the QoS of the latency-critical workloads. Hipster achieves its goal by exploring heterogeneous multicores and dynamic voltage and frequency scaling (DVFS). To improve data center utilization and make best usage of the available resources, Hipster can dynamically assign remaining cores to batch workloads without violating the QoS constraints for the latency-critical workloads. We perform experiments using a 64-bit ARM big.LITTLE platform, and show that, compared to prior work, Hipster improves the QoS guarantee for Web-Search from 80% to 96%, and for Memcached from 92% to 99%, while reducing the energy consumption by up to 18%.

show abstract

Section: Motivationmentioning

confidence: 99%

Hipster: Hybrid Task Manager for Latency-Critical Cloud Workloads

Nishtala

Carpenter

Petrucci

et al. 2017

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

View full text Add to dashboard Cite

show abstract

“…In practical scenarios, each workload has a time-varying load [38] and a QoS target that needs to be met. As shown in Figure 2 and 3, there exists a unique con guration for each load that optimizes energy e ciency.…”

Section: Exploring Individual Workload Particularitiesmentioning

confidence: 99%

“…At runtime, Hipster determines when to dynamically switch between the learning and exploitation phases, based on a pre xed time quantum. At deployment stage, we ensure that the bucket size for each workload gives at least 95 % QoS guarantee [38] with minimal energy consumption.…”

Section: Learning and Exploitation Phasesmentioning

confidence: 99%

The Hipster Approach for Improving Cloud System Efficiency

Nishtala

Carpenter

Petrucci

et al. 2017

ACM Trans. Comput. Syst.

View full text Add to dashboard Cite

In 2013, U.S. data centers accounted for 2.2% of the country's total electricity consumption, a gure that is projected to increase rapidly over the next decade. Many important data center workloads in cloud computing are interactive, and they demand strict levels of quality-of-service (QoS) to meet user expectations, making it challenging to optimize power consumption along with increasing performance demands. This paper introduces Hipster, a technique that combines heuristics and reinforcement learning to improve resource e ciency in cloud systems. Hipster explores heterogeneous multi-cores and dynamic voltage and frequency scaling (DVFS) for reducing energy consumption while managing the QoS of the latency-critical workloads. To improve data center utilization and make best usage of the available resources, Hipster can dynamically assign remaining cores to batch workloads without violating the QoS constraints for the latency-critical workloads. We perform experiments using a 64-bit ARM big.LITTLE platform, and show that, compared to prior work, Hipster improves the QoS guarantee for Web-Search from 80% to 96%, and for Memcached from 92% to 99%, while reducing the energy consumption by up to 18%. Hipster is also e ective in learning and adapting automatically to speci c requirements of new incoming workloads just enough to meet the QoS and optimize resource consumption. In this work, we extend our previous work in several ways. First, we present an analysis of the size of the reward lookup table and an optimization for the table to improve the scalability of our reinforcement learning mechanism. Second, we demonstrate Hipster's capability to adapt to changes in the latency-critical application at runtime and still satisfy QoS guarantees of the new incoming applications. Lastly, we present a deployment methodology for setting up new applications managed by Hipster's runtime system.

show abstract

“…Ideally, each service node will use the fewest resources (cores, memory, or IOPS) needed to satisfy packet rate and tail latency requirements at any point. Unfortunately, classic operating system schedulers are illmatched to ensure tail control [Leverich and Kozyrakis 2014;Li et al 2014]. Novel dynamic resource management mechanisms and policies are required to improve energy proportionality and workload consolidation in the presence of latency-sensitive applications [Lo et al 2014[Lo et al , 2015.…”

Section: Challenges For Datacenter Applicationsmentioning

confidence: 99%

“…They are particularly sensitive to resource allocation and frequency settings, and they suffer frequent tail latency violations when common power management or consolidation approaches are used [Leverich and Kozyrakis 2014;Li et al 2014]. As a result, operators typically deploy them on dedicated servers running in polling mode, forgoing opportunities for workload consolidation and reduced power consumption at below-peak utilization levels.…”

Section: Introductionmentioning

confidence: 99%

The IX Operating System

Belay¹,

Prekas

Primorac

et al. 2016

ACM Trans. Comput. Syst.

View full text Add to dashboard Cite

The conventional wisdom is that aggressive networking requirements, such as high packet rates for small messages and μs-scale tail latency, are best addressed outside the kernel, in a user-level networking stack. We present IX, a dataplane operating system that provides high I/O performance and high resource efficiency while maintaining the protection and isolation benefits of existing kernels.IX uses hardware virtualization to separate management and scheduling functions of the kernel (control plane) from network processing (dataplane). The dataplane architecture builds upon a native, zero-copy API and optimizes for both bandwidth and latency by dedicating hardware threads and networking queues to dataplane instances, processing bounded batches of packets to completion, and eliminating coherence traffic and multicore synchronization. The control plane dynamically adjusts core allocations and voltage/frequency settings to meet service-level objectives.We demonstrate that IX outperforms Linux and a user-space network stack significantly in both throughput and end-to-end latency. Moreover, IX improves the throughput of a widely deployed, key-value store by up to 6.4× and reduces tail latency by more than 2×. With three varying load patterns, the control plane saves 46%-54% of processor energy, and it allows background jobs to run at 35%-47% of their standalone throughput.

show abstract

Tales of the Tail

Cited by 164 publications

References 18 publications

Hipster: Hybrid Task Manager for Latency-Critical Cloud Workloads

Hipster: Hybrid Task Manager for Latency-Critical Cloud Workloads

The Hipster Approach for Improving Cloud System Efficiency

The IX Operating System

Contact Info

Product

Resources

About