1991
DOI: 10.1017/s0269964800002230
|View full text |Cite
|
Sign up to set email alerts
|

Constrained Discounted Markov Decision Chains

Abstract: A Markov decision chain with countable state space incurs two types of costs: an operating cost and a holding cost. The objective is to minimize the expected discounted operating cost, subject to a constraint on the expected discounted holding cost. The existence of an optimal randomized simple policy is proved. This is a policy that randomizes between two stationary policies, that differ in at most one state. Several examples from the control of discrete time queueing systems are discussed.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

2
24
0

Year Published

1993
1993
2020
2020

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 39 publications
(26 citation statements)
references
References 10 publications
2
24
0
Order By: Relevance
“…To describe the solution of Problem 2, we first define Bernoulli randomized strategy and Bernoulli randomized simple strategy [36].…”
Section: ) Results For Constrained Communicationmentioning
confidence: 99%
“…To describe the solution of Problem 2, we first define Bernoulli randomized strategy and Bernoulli randomized simple strategy [36].…”
Section: ) Results For Constrained Communicationmentioning
confidence: 99%
“…Such problems frequently arise in computer networks and data communications, see Lazar [18], Spieksma and Hordijk [14], Nain and Ross [19] and Altman and Shwartz [1]. The theory for solving constrained MDPs was developed by Hordijk and Kallenberg [16], Kallenberg [17], Beutler and Ross [8], Altman and Shwartz [2,4], Altman [6], Spieksma [22], Sennott [20,21] and Borkar [9].…”
Section: Introductionmentioning
confidence: 99%
“…The case of finite, denumerable, and compact state spaces has been widely dealed (Beutler and Ross 1985;Sennott 1993;Borkar 1994;Kurano et al 2000a;Piunovskiy 1993Piunovskiy , 1997Piunovskiy and Khametov 1991;Tanaka 1991;Hu and Yue 2008). The discounted performance criteria has also already been dealed (Feimberg and Shwartz 1996;González-Hernández and Hernández-Lerma 2005;Hernández-Lerma and González-Hernández 2000;and Sennott 1991). We can see other related aspects (Collins and McNamara 1998;Kurano et al 2000b;and Yushkevich 1997).…”
Section: Introductionmentioning
confidence: 94%
“…It is already known that for Markov decision constraint problems, there exist optimal randomized policies (Beutler and Ross 1985;Borkar 1994;Frid 1972;González-Hernández and Hernández-Lerma 2005;Haviv 1996;Sennott 1991Sennott , 1993. The case of finite, denumerable, and compact state spaces has been widely dealed (Beutler and Ross 1985;Sennott 1993;Borkar 1994;Kurano et al 2000a;Piunovskiy 1993Piunovskiy , 1997Piunovskiy and Khametov 1991;Tanaka 1991;Hu and Yue 2008).…”
Section: Introductionmentioning
confidence: 99%