2016
DOI: 10.1007/s11134-016-9485-y
|View full text |Cite
|
Sign up to set email alerts
|

Queueing with redundant requests: exact analysis

Abstract: Recent computer systems research has proposed using redundant requests to reduce latency. The idea is to run a request on multiple servers and wait for the first completion (discarding all remaining copies of the request). However, there is no exact analysis of systems with redundancy. This paper presents the first exact analysis of systems with redundancy. We allow for any number of classes of redundant requests, any number of classes of non-redundant requests, any degree of redundancy, and any number of hete… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
106
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 114 publications
(113 citation statements)
references
References 37 publications
1
106
0
Order By: Relevance
“…This implies that each job is forked to the identical number of servers, and job is completed by joining identical number of service completions. Tight numerical bounds are provided in [6], analytical bounds are presented in [7], [15]- [17], analytical approximations appear in [18], exact analysis for small systems in [19], exact analysis for random independent scheduling for asymptotically large number of servers in [20], and an exact analysis of tail index for Pareto-distributed file sizes in [21].…”
Section: A Related Workmentioning
confidence: 99%
“…This implies that each job is forked to the identical number of servers, and job is completed by joining identical number of service completions. Tight numerical bounds are provided in [6], analytical bounds are presented in [7], [15]- [17], analytical approximations appear in [18], exact analysis for small systems in [19], exact analysis for random independent scheduling for asymptotically large number of servers in [20], and an exact analysis of tail index for Pareto-distributed file sizes in [21].…”
Section: A Related Workmentioning
confidence: 99%
“…The sojourn time for the 'N'-system under the Redundancy Service policy is derived in Theorems 2 and 3 of [2]. From this we obtain the expected sojourn times for type 1 and type 2 customers:…”
Section: A Comparison Of Fcfs-alis and Redundancy Service For The 'N'mentioning
confidence: 99%
“…The customer and all its copies leave the system when the first of its copies completes service. This model was studied by Gardner et al [2]. -A Parallel FCFS Matching Queue: There is an arrival stream of customers of types C, and an independent arrival stream of servers of types S. When a customer arrives he joins a queue of customers waiting for service.…”
Section: Introductionmentioning
confidence: 99%
“…We omit the details due to space considerations; for the full proof, see the associated technical report [15].…”
Section: Proofs For N Modelmentioning
confidence: 99%