Jefferson Huang scite author profile

Jefferson Huang

3Publications

42Citation Statements Received

190Citation Statements Given

How they've been cited

How they cite others

187

Affiliations

Naval Postgraduate School, Cornell University, Stony Brook University

Publications

Order By: Most citations

On the reduction of total‐cost and average‐cost MDPs to discounted MDPs

Feinberg

Huang

2017

Naval Research Logistics

View full text Add to dashboard Cite

This article provides conditions under which total‐cost and average‐cost Markov decision processes (MDPs) can be reduced to discounted ones. Results are given for transient total‐cost MDPs with transition rates whose values may be greater than one, as well as for average‐cost MDPs with transition probabilities satisfying the condition that there is a state such that the expected time to reach it is uniformly bounded for all initial states and stationary policies. In particular, these reductions imply sufficient conditions for the validity of optimality equations and the existence of stationary optimal policies for MDPs with undiscounted total cost and average‐cost criteria. When the state and action sets are finite, these reductions lead to linear programming formulations and complexity estimates for MDPs under the aforementioned criteria.© 2017 Wiley Periodicals, Inc. Naval Research Logistics 66:38–56, 2019

show abstract

The value iteration algorithm is not strongly polynomial for discounted dynamic programming

Feinberg

Huang

2014

Operations Research Letters

View full text Add to dashboard Cite

This note provides upper bounds on the number of operations required to compute by value iterations a nearly optimal policy for an infinite-horizon discounted Markov decision process with a finite number of states and actions. For a given discount factor, magnitude of the reward function, and desired closeness to optimality, these upper bounds are strongly polynomial in the number of state-action pairs, and one of the provided upper bounds has the property that it is a non-decreasing function of the value of the discount factor.

show abstract

Strong polynomiality of policy iterations for average-cost MDPs modeling replacement and maintenance problems

Feinberg

Huang

2013

Operations Research Letters

View full text Add to dashboard Cite

a b s t r a c tThis note considers an average-cost Markov Decision Process (MDP) with finite state and action sets and satisfying the additional condition that there is a state to which the system jumps from any state and under any action with a positive probability. The main result is that the policy iteration algorithm is strongly polynomial for such MDPs, which are often used to model replacement and maintenance problems.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jefferson Huang

On the reduction of total‐cost and average‐cost MDPs to discounted MDPs

The value iteration algorithm is not strongly polynomial for discounted dynamic programming

Strong polynomiality of policy iterations for average-cost MDPs modeling replacement and maintenance problems

Contact Info

Product

Resources

About