2020
DOI: 10.3390/math9010052
|View full text |Cite
|
Sign up to set email alerts
|

Fast Two-Stage Computation of an Index Policy for Multi-Armed Bandits with Setup Delays

Abstract: We consider the multi-armed bandit problem with penalties for switching that include setup delays and costs, extending the former results of the author for the special case with no switching delays. A priority index for projects with setup delays that characterizes, in part, optimal policies was introduced by Asawa and Teneketzis in 1996, yet without giving a means of computing it. We present a fast two-stage index computing method, which computes the continuation index (which applies when the project has been… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 54 publications
(108 reference statements)
0
1
0
Order By: Relevance
“…The latter has huge modeling power but is computationally intractable, and Whittle's index policy has proven effective in an ever-increasing variety of models for multifarious applications. Thus, e.g., to name a few, scheduling multi-class make-to-stock queues [9], scheduling multi-class queues with finite buffers [10], admission control and routing to parallel queues with reneging [11], obsolescence mitigation strategies [12], sensor scheduling and dynamic channel selection [13][14][15][16], group maintenance [17], multi-target tracking with Kalman filter dynamics [18,19], scheduling multi-armed bandits with switching costs [20] or switching delays [21], the dynamic prioritization of medical treatments or interventions [22,23], and resource allocation with varying requests and with resources shared by multiple requests [24].…”
Section: Introductionmentioning
confidence: 99%
“…The latter has huge modeling power but is computationally intractable, and Whittle's index policy has proven effective in an ever-increasing variety of models for multifarious applications. Thus, e.g., to name a few, scheduling multi-class make-to-stock queues [9], scheduling multi-class queues with finite buffers [10], admission control and routing to parallel queues with reneging [11], obsolescence mitigation strategies [12], sensor scheduling and dynamic channel selection [13][14][15][16], group maintenance [17], multi-target tracking with Kalman filter dynamics [18,19], scheduling multi-armed bandits with switching costs [20] or switching delays [21], the dynamic prioritization of medical treatments or interventions [22,23], and resource allocation with varying requests and with resources shared by multiple requests [24].…”
Section: Introductionmentioning
confidence: 99%