2009 International Joint Conference on Neural Networks 2009
DOI: 10.1109/ijcnn.2009.5178716
|View full text |Cite
|
Sign up to set email alerts
|

A retrospective on Adaptive Dynamic Programming for control

Abstract: Some three decades ago, certain computational intelligence methods of reinforcement learning were recognized as implementing an approximation of Bellman's Dynamic Programming method, which is known in the controls community as an important tool for designing optimal control policies for nonlinear plants and sequential decision making. Significant theoretical and practical developments have occurred within this arena, mostly in the past decade, with the methodology now usually referred to as Adaptive Dynamic Pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
10
0
3

Year Published

2012
2012
2022
2022

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 32 publications
(13 citation statements)
references
References 43 publications
0
10
0
3
Order By: Relevance
“…Several forms of actor-critic methods are available [33]. Heuristic Dynamic Programming (HDP) can be seen as the default actor-critic method, where the critic approximates the sum of discounted future rewards.…”
Section: Temporal-difference Learning: Actors Critics Actor-criticsmentioning
confidence: 99%
“…Several forms of actor-critic methods are available [33]. Heuristic Dynamic Programming (HDP) can be seen as the default actor-critic method, where the critic approximates the sum of discounted future rewards.…”
Section: Temporal-difference Learning: Actors Critics Actor-criticsmentioning
confidence: 99%
“…В даний час розроблено ряд методів нейроме-режевого управління, викладених в роботах [6][7][8][9][10][11][12][13][14][15][16][17][18][19][20] Ці роботи наочно ілюструють ефективність за-стосування нейромережевих методів управління нелінійними динамічними об'єктами в складних умовах функціонування.…”
Section: вступunclassified
“…В даний час розроблено ряд методів нейроме-режевого управління, викладених в роботах [6][7][8][9][10][11][12][13][14][15][16][17][18][19][20]. Отримано багато прикладів успішно працюючих нейромережевих систем управління: літаком [21][22][23], вертольотом [24], гірничозбагачувальним про-цесом [25], автомобілем-роботом [26], гібридним двигуном автомобіля [26], електропіччю [6], турбо-генератором [12], зварювальним апаратом [16], пневмоциліндром [28], об'єктом спеціального при-значення [29], моделлю перевернутого маятника [30] і інших.…”
Section: вступunclassified
See 1 more Smart Citation
“…Unfortunately, dynamic programming could not be used in many applications because of the 'curse of dimensionality', that is, computational complexity increases exponentially with the dimensionality of the application. To overcome this problem, methodologies known as neuro-dynamic programming [4], or adaptive critic design (ACD) [5], also known as adaptive dynamic programming [6], have been proposed. The core of these methods is the approximation of Bellman's equation or value function by learning, and also using neural networks.…”
Section: Introductionmentioning
confidence: 99%