Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs

Sennott, Linn I.

doi:10.1287/opre.37.4.626

Cited by 192 publications

(177 citation statements)

References 18 publications

Supporting

Mentioning

173

Contrasting

Unclassified

Order By: Relevance

“…Moreover, as discussed above, the model is equivalent to a discrete time model by considering the system state at transition epochs. For the discrete model general results on average cost Markov decision problems (see, e.g., Sennott, 1989) assure the existence of a stationary average cost optimal policy. Average cost optimality of an (s, Q)-policy follows since any stationary policy based on the inventory position in the above model is equal to an (s, Q)-policy up to a transient phase.…”

Section: Proofmentioning

confidence: 99%

“…To this end, we exploit general theory of Markov decision processes that has been well developed in the past two decades. In particular, we make use of Sennott's results on infinite state Markov decision processes with unbounded costs (Sennott, 1989).…”

Section: General Demand Case: Optimal Policy Structurementioning

confidence: 99%

“…Sennott (1989) shows convergence of successive approximation to the infinite horizon discounted cost function under the condition that V α (y) is finite for all y and α. Moreover, this condition is shown to hold if there exists a stationary policy inducing an ergodic Markov chain and yielding finite average costs in steady state.…”

Section: General Demand Case: Optimal Policy Structurementioning

confidence: 99%

“…Sufficient conditions have been given by Sennott (1989) for general Markov decision processes. We show that these conditions hold in our return flow model.…”

Section: Proofmentioning

confidence: 99%

See 3 more Smart Citations

Quantitative Models for Reverse Logistics

Fleischmann

2001

Lecture Notes in Economics and Mathematical Systems

272

144

View full text Add to dashboard Cite

Section: Proofmentioning

confidence: 99%

Section: General Demand Case: Optimal Policy Structurementioning

confidence: 99%

Section: General Demand Case: Optimal Policy Structurementioning

confidence: 99%

“…Sufficient conditions have been given by Sennott (1989) for general Markov decision processes. We show that these conditions hold in our return flow model.…”

Section: Proofmentioning

confidence: 99%

See 2 more Smart Citations

Quantitative Models for Reverse Logistics

Fleischmann

2001

Lecture Notes in Economics and Mathematical Systems

272

144

View full text Add to dashboard Cite

“…Techniques for deriving the former from the latter are now well developed. Recent results specifically motivated by control of queues may be found in Borkar [8][9][10], Weber and Stidham [67], Cavazos-Cadena [12,13], Sennott [54,55]. For a survey, see Arapostathis et al [2].…”

Section: Introductionmentioning

confidence: 99%