Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290)
DOI: 10.1109/ijcnn.2002.1007791
|View full text |Cite
|
Sign up to set email alerts
|

Proposed framework for applying adaptive critics in real-time realm

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(12 citation statements)
references
References 30 publications
0
12
0
Order By: Relevance
“…The control input can be calculated using Equation (34) once the critic network and system state density models become available. The implementation of this two stage optimisation method can be performed efficiently by utilising the modular approach constituting of functional modules and algorithmic modules (Ferrari and Stengel, 2004;Lendaris et al, 2002;Herzallah, 2007). The main functional modules are the action and critic networks.…”
Section: Generalised Nonlinear Probabilistic Control Algorithmmentioning
confidence: 99%
“…The control input can be calculated using Equation (34) once the critic network and system state density models become available. The implementation of this two stage optimisation method can be performed efficiently by utilising the modular approach constituting of functional modules and algorithmic modules (Ferrari and Stengel, 2004;Lendaris et al, 2002;Herzallah, 2007). The main functional modules are the action and critic networks.…”
Section: Generalised Nonlinear Probabilistic Control Algorithmmentioning
confidence: 99%
“…Similarly, the control law can be calculated from (35) once the critic network and system dynamic models become available. Although the probabilistic DHP adaptive critic approach involves multiple computation levels, its implementation can be made by means of modular approach constituting of functional modules and algorithmic modules [8], [20], [12]. The key functional modules are the action and critic networks.…”
Section: Nonlinear Randomized Control Algorithm Based On Probabilmentioning
confidence: 99%
“…The proof of convergence given in [16], [8] is directly applicable to the probabilistic adaptive critic design in this paper. A nonlinear control problem example to demonstrate the convergence of the proposed probabilistic critic network is given in Section V. Further discussion on the convergence and speed of convergence of adaptive critic designs can be found in [20]. Moreover, empirical evidence on the convergence of the adaptive critic design can be found in [3], [11], [19], [21].…”
Section: Nonlinear Randomized Control Algorithm Based On Probabilmentioning
confidence: 99%
“…In [81,185] a RFC for the NASA F-18/HARV based on a QFT compensator and an adaptive filter is used. Flight control based on reinforcement learning is the subject of [89,90,126]. Indirect adaptive control using a moving window/batch estimation for partial loss of the horizontal tail surface is studied in [157].…”
Section: Adaptive Controlmentioning
confidence: 99%