Abstract:A multiclass queueing system is considered, with heterogeneous service stations, each consisting of many servers with identical capabilities. An optimal control problem is formulated, where the control corresponds to scheduling and routing, and the cost is a cumulative discounted functional of the system's state. We examine two versions of the problem: "nonpreemptive," where service is uninterruptible, and "preemptive," where service to a customer can be interrupted and then resumed, possibly at a different st… Show more
“…Atar et al [6] proved that the policies obtained from this approach are asymptotically optimal. Atar [4,5] followed a similar approach to that in [6] to find asymptotically optimal policies for tree-like systems. In Mandelbaum et al [33], they used a uniform acceleration technique to obtain the fluid and diffusion limit of a Markovian service network.…”
Abstract:We consider a class of queueing systems that consist of server pools in parallel and multiple customer classes. Customer service times are assumed to be exponentially distributed. We study the asymptotic behavior of these queueing systems in a heavy traffic regime that is known as the Halfin and Whitt many-server asymptotic regime. Our main contribution is a general framework for establishing state space collapse results in this regime for parallel server systems. In our work, state space collapse refers to a decrease in the dimension of the processes tracking the number of customers in each class waiting for service and the number of customers in each class being served by various server pools. We define and introduce a "state space collapse" function, which governs the exact details of the state space collapse. We show that a state space collapse result holds in many-server heavy traffic if a corresponding deterministic hydrodynamic model satisfies a similar state space collapse condition. Our methodology is similar in spirit to that in Bramson [10], which focuses on the conventional heavy traffic regime. We illustrate the applications of our results by establishing state space collapse results in many-server diffusion limits of static-buffer-priority V-parallel server systems, N-model parallel server systems, and minimum-expecteddelay-faster-server-first distributed server pools systems. We show for these systems that the condition on the hydrodynamic model can easily be checked using the standard tools for fluid models.
“…Atar et al [6] proved that the policies obtained from this approach are asymptotically optimal. Atar [4,5] followed a similar approach to that in [6] to find asymptotically optimal policies for tree-like systems. In Mandelbaum et al [33], they used a uniform acceleration technique to obtain the fluid and diffusion limit of a Markovian service network.…”
Abstract:We consider a class of queueing systems that consist of server pools in parallel and multiple customer classes. Customer service times are assumed to be exponentially distributed. We study the asymptotic behavior of these queueing systems in a heavy traffic regime that is known as the Halfin and Whitt many-server asymptotic regime. Our main contribution is a general framework for establishing state space collapse results in this regime for parallel server systems. In our work, state space collapse refers to a decrease in the dimension of the processes tracking the number of customers in each class waiting for service and the number of customers in each class being served by various server pools. We define and introduce a "state space collapse" function, which governs the exact details of the state space collapse. We show that a state space collapse result holds in many-server heavy traffic if a corresponding deterministic hydrodynamic model satisfies a similar state space collapse condition. Our methodology is similar in spirit to that in Bramson [10], which focuses on the conventional heavy traffic regime. We illustrate the applications of our results by establishing state space collapse results in many-server diffusion limits of static-buffer-priority V-parallel server systems, N-model parallel server systems, and minimum-expecteddelay-faster-server-first distributed server pools systems. We show for these systems that the condition on the hydrodynamic model can easily be checked using the standard tools for fluid models.
“…Our initial investigation suggests that the insights gained from studying the V-model are useful in analyzing more complicated network structures (Gurvich 2004). Other researchers have also tackled this general skill-based routing (SBR) problem (e.g., Bassamboo et al 2006aBassamboo et al , 2006bAtar 2005;Tezcan and Dai 2007;Gurvich and Whitt 2006), but many issues remain unresolved.…”
W e study large-scale service systems with multiple customer classes and many statistically identical servers.The following question is addressed: How many servers are required (staffing) and how does one match them with customers (control) to minimize staffing cost, subject to class-level quality-of-service constraints? We tackle this question by characterizing scheduling and staffing schemes that are asymptotically optimal in the limit, as system load grows to infinity. The asymptotic regimes considered are consistent with the efficiencydriven (ED), quality-driven (QD), and quality-and-efficiency-driven (QED) regimes, first introduced in the context of a single-class service system.Our main findings are as follows: (a) Decoupling of staffing and control, namely, (i) staffing disregards the multiclass nature of the system and is analogous to the staffing of a single-class system with the same aggregate demand and a single global quality-of-service constraint, and (ii) class-level service differentiation is obtained by using a simple idle-server-based threshold-priority (ITP) control (with state-independent thresholds); and (b) robustness of the staffing and control rules: our proposed single-class staffing (SCS) rule and ITP control are approximately optimal under various problem formulations and model assumptions. Particularly, although our solution is shown to be asymptotically optimal for large systems, we numerically demonstrate that it performs well also for relatively small systems.
“…Papers [24] and [7] formulated a diffusion control problem to study a V-parallel server system with impatient customers in many-server heavy traffic; [7] proved that the policies obtained from this approach are asymptotically optimal. Works [5,6] followed a similar approach to that in [7] to find asymptotically optimal policies for tree-like systems. In [38] and [15] it is shown that a greedy policy is asymptotically optimal first for N-systems and then for general systems when the service rates are only server pool dependent.…”
Section: Review Of Related Previous Workmentioning
confidence: 99%
“…To improve exposition, from this point on in the proof, consider a special system with I = {1, 2, 3, 4}, J = {5, 6, 7, 8}, and basic activities E b = {(1, 5), (2,5), (2,6), (2,7), (3,7), (3,8), (4,5)}, see Fig. 1.…”
Section: Proof Of Theorem 31mentioning
confidence: 99%
“…The service rates are μ 15 = 4, μ 16 = 5, μ 25 = 2, μ 26 = 4, μ 27 = 1, μ 36 = 2, μ 37 = 1, μ 38 = 1, μ 35 = 1.5, μ 45 = 4, μ 48 = 1.5 and all other μ ij 's are zero (for non-feasible activities); β 5 = β 6 = β 7 = β 8 = 1. The input rate parameters are λ 1 = 2, λ 2 = 4.8, λ 3 = 1.6, and λ 4 = 1.2, which makes all feasible activities basic, except activities (3,5), (3,6), (1,6), and (4,8). We use the scaling parameter r = 100, so that there are 100 servers in each pool.…”
Section: Performance Of Shadow Routing Algorithm In a 4 × 4 Systemmentioning
A general model with multiple input flows (classes) and several flexible multi-server pools is considered. We propose a robust, generic scheme for routing new arrivals, which optimally balances server pools' loads, without the knowledge of the flow input rates and without solving any optimization problem. The scheme is based on Shadow routing in a virtual queueing system. We study the behavior of our scheme in the Halfin-Whitt (or, QED) asymptotic regime, when server pool sizes and the input rates are scaled up simultaneously by a factor r growing to infinity, while keeping the system load within O( (ii) We show that some natural algorithms, such as MaxWeight, that guarantee stability, are not order-optimal. (iii) Under the complete resource pooling condition, we prove the diffusion limit of the arrival processes into server pools, under the Shadow routing. (We conjecture that result (iii) leads to order-optimality of the Shadow routing algorithm; a formal proof of this fact is an important subject of future work.) Simulation results demonstrate good performance and robustness of our scheme.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.