2020 IEEE International Symposium on High Performance Computer Architecture (HPCA) 2020
DOI: 10.1109/hpca47549.2020.00026
|View full text |Cite
|
Sign up to set email alerts
|

Q-Zilla: A Scheduling Framework and Core Microarchitecture for Tail-Tolerant Microservices

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 19 publications
(4 citation statements)
references
References 58 publications
0
4
0
Order By: Relevance
“…The scheduler, however, is limited to improving the latency of networking tasks only. Q‐Zilla 38 is a scheduling framework aimed at reducing queuing delays and thereby reducing tail latency. The queuing algorithm has also been implemented at the single‐node level to create a micro‐architecture (CoreZilla) that improves CPU access speeds.…”
Section: Related Workmentioning
confidence: 99%
“…The scheduler, however, is limited to improving the latency of networking tasks only. Q‐Zilla 38 is a scheduling framework aimed at reducing queuing delays and thereby reducing tail latency. The queuing algorithm has also been implemented at the single‐node level to create a micro‐architecture (CoreZilla) that improves CPU access speeds.…”
Section: Related Workmentioning
confidence: 99%
“…In this work, they consider the energy consumption of all computations and transmissions, ignoring the scarcity differences among various devices/servers, such as, the transmission energy of sensing devices is much scarcer than the computation energy of cloud servers. In addition, the optimization of accumulated sum may lead to an imbalance performance among tasks, such as the long tail latency [133].…”
Section: D: Multi-objective Optimizationmentioning
confidence: 99%
“…Shinjuku [24] seeks to address this challenge via implementing a highly efficient preemption mechanism to enable processor sharing by eliminating the operating system threading overheads. RPCValet [12], Nebula [43], and Q-Zilla [35,36] make the observation that shared request queues are very costly for s-scale microservices despite being imperative for achieving minimal tail latency. They seek to enable shared queues through specialized hardware support.…”
Section: Related Workmentioning
confidence: 99%