“…Moreover, as discussed above, the model is equivalent to a discrete time model by considering the system state at transition epochs. For the discrete model general results on average cost Markov decision problems (see, e.g., Sennott, 1989) assure the existence of a stationary average cost optimal policy. Average cost optimality of an (s, Q)-policy follows since any stationary policy based on the inventory position in the above model is equal to an (s, Q)-policy up to a transient phase.…”