Approximate dynamic programming via iterated Bellman inequalities

Wang, Yan; O’Donoghue, Brendan; Boyd, Stephen

doi:10.1002/rnc.3152

Cited by 70 publications

(116 citation statements)

References 66 publications

(88 reference statements)

Supporting

Mentioning

116

Contrasting

Order By: Relevance

“…This value-function-based method is called DP, and a variety of topics on stochastic optimal control and DP are well-addressed by [5][6][7][8]. A large class of stochastic optimal control problems deal with the dynamics of the form in Equation (1) and are concerned with finding a state-feedback control policy:…”

Section: Preliminariesmentioning

confidence: 99%

“…where T is the Bellman operator (see e.g., [5]), whose domain and codomain are both function spaces mapping X onto…”

Section: Preliminariesmentioning

confidence: 99%

“…As is well known, the Bellman equation or its operator equation version cannot be solved exactly, except in simple special cases [5,6], and an alternative strategy when finding the exact state value function is the use of ADP relying on an approximate state value function ˆ: V X R  . In the DPM problem discussed in this paper, we are concerned with an ADP solution utilizing convex quadratic functions.…”

Section: Preliminariesmentioning

confidence: 99%

“…In the DPM problem discussed in this paper, we are concerned with an ADP solution utilizing convex quadratic functions. This class of quadratic functions is a good choice for the approximate state value function [5,8] because these functions are convenient and powerful for handling various optimization steps appearing in the ADP procedure.…”

Section: Preliminariesmentioning

confidence: 99%

“…This value-function-based method is called dynamic programming (DP) and provides an important theoretical foundation for various optimal control problems. However, most real-world optimal control problems cannot be solved by DP exactly, except in very simple cases, and many studies rely on ADP, which finds an approximate solution to the stochastic optimal control problem utilizing an approximate value function, in order to obtain a good suboptimal control policy [5][6][7][8][9]. The main objective of this paper is to consider an ADP-based solution to dynamic power management (DPM) for portable HPSSs employing batteries and supercapacitors.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Dynamic Power Management for Portable Hybrid Power-Supply Systems Utilizing Approximate Dynamic Programming

et al. 2015

View full text Add to dashboard Cite

Abstract:Recently, the optimization of power flows in portable hybrid power-supply systems (HPSSs) has become an important issue with the advent of a variety of mobile systems and hybrid energy technologies. In this paper, a control strategy is considered for dynamically managing power flows in portable HPSSs employing batteries and supercapacitors. Our dynamic power management strategy utilizes the concept of approximate dynamic programming (ADP). ADP methods are important tools in the fields of stochastic control and machine learning, and the utilization of these tools for practical engineering problems is now an active and promising research field. We propose an ADP-based procedure based on optimization under constraints including the iterated Bellman inequalities, which can be solved by convex optimization carried out offline, to find the optimal power management rules for portable HPSSs. The effectiveness of the proposed procedure is tested through dynamic simulations for smartphone workload scenarios, and simulation results show that the proposed strategy can successfully cope with uncertain workload demands.

show abstract

Section: Preliminariesmentioning

confidence: 99%

“…where T is the Bellman operator (see e.g., [5]), whose domain and codomain are both function spaces mapping X onto…”

Section: Preliminariesmentioning

confidence: 99%

Section: Preliminariesmentioning

confidence: 99%

Section: Preliminariesmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Dynamic Power Management for Portable Hybrid Power-Supply Systems Utilizing Approximate Dynamic Programming

et al. 2015

View full text Add to dashboard Cite

show abstract

Global Adaptive Dynamic Programming for Nonlinear Polynomial Systems

Jiang

2017

Robust Adaptive Dynamic Programming

View full text Add to dashboard Cite

Quadratic approximate dynamic programming for input‐affine systems

Keshavarz

Boyd

2012

Intl J Robust & Nonlinear

Self Cite

View full text Add to dashboard Cite

SUMMARYWe consider the use of quadratic approximate value functions for stochastic control problems with input‐affine dynamics and convex stage cost and constraints. Evaluating the approximate dynamic programming policy in such cases requires the solution of an explicit convex optimization problem, such as a quadratic program, which can be carried out efficiently. We describe a simple and general method for approximate value iteration that also relies on our ability to solve convex optimization problems, in this case, typically a semidefinite program. Although we have no theoretical guarantee on the performance attained using our method, we observe that very good performance can be obtained in practice.Copyright © 2012 John Wiley & Sons, Ltd.

show abstract

Approximate dynamic programming via iterated Bellman inequalities

Cited by 70 publications

References 66 publications

Dynamic Power Management for Portable Hybrid Power-Supply Systems Utilizing Approximate Dynamic Programming

Dynamic Power Management for Portable Hybrid Power-Supply Systems Utilizing Approximate Dynamic Programming

Global Adaptive Dynamic Programming for Nonlinear Polynomial Systems

Quadratic approximate dynamic programming for input‐affine systems

Contact Info

Product

Resources

About