1978
DOI: 10.1007/bf01818535
|View full text |Cite
|
Sign up to set email alerts
|

Functional equations in dynamic programming

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
71
0
2

Year Published

1983
1983
2016
2016

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 172 publications
(74 citation statements)
references
References 6 publications
1
71
0
2
Order By: Relevance
“…It is well known that equation of the type (4.1) provides useful tools for mathematical optimization, computer and dynamic programming (see, [9,12]). Let B(W ) denote the space of all bounded real-valued functions defined on the set W , where B(W ) is endowed with the metric d(h, k) = sup x∈W |hx − kx| for all h, k ∈ B(W ).…”
Section: An Application In Dynamic Programmingmentioning
confidence: 99%
“…It is well known that equation of the type (4.1) provides useful tools for mathematical optimization, computer and dynamic programming (see, [9,12]). Let B(W ) denote the space of all bounded real-valued functions defined on the set W , where B(W ) is endowed with the metric d(h, k) = sup x∈W |hx − kx| for all h, k ∈ B(W ).…”
Section: An Application In Dynamic Programmingmentioning
confidence: 99%
“…The idea of using value function approximation goes back to the early days of dynamic programming (Samuel, 1959;Bellman and Dreyfus, 1959). With the recent growth of interest in reinforcement learning, work on value function approximation methods flourished (Bertsekas and Tsitsiklis, 1996;Sutton and Barto, 1998).…”
Section: Related Workmentioning
confidence: 99%
“…If suitable features and representative states are chosen, V(W(t)) may converge to a reasonable approximation of the optimal cost-to-go vector V*. Such an algorithm has been considered in the literature (Bellman (1959), Reetz (1977), Morin (1979)). Of these references, only (Reetz (1977)), establishes convergence and error bounds.…”
Section: Algorithmic Modelmentioning
confidence: 99%
“…Bellman and Dreyfus (1959) explored the use of polynomials as compact representations for accelerating dynamic programming. Whitt (1978) and Reetz (1977) analyzed approaches of reducing state space sizes, which lead to compact representations.…”
Section: Introductionmentioning
confidence: 99%