2013
DOI: 10.1016/j.neunet.2013.01.014
|View full text |Cite
|
Sign up to set email alerts
|

Probabilistic DHP adaptive critic for nonlinear stochastic control systems

Abstract: Following the recently developed algorithms for fully probabilistic control design for general dynamic stochastic systems [15], [18], this paper presents the solution to the probabilistic dual heuristic programming (DHP) adaptive critic method [15] and randomized control algorithm for stochastic nonlinear dynamical systems. The purpose of the randomized control input design is to make the joint probability density function of the closed loop system as close as possible to a predetermined ideal joint probabilit… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
27
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
6
1

Relationship

4
3

Authors

Journals

citations
Cited by 22 publications
(27 citation statements)
references
References 28 publications
0
27
0
Order By: Relevance
“…Based on the Fully Probabilistic Design (FPD) [13]- [15], the control law c * (u k−1 |x k−1 ) which minimises the performance index (14) takes the following form,…”
Section: Optimal Controller Lawmentioning
confidence: 99%
See 2 more Smart Citations
“…Based on the Fully Probabilistic Design (FPD) [13]- [15], the control law c * (u k−1 |x k−1 ) which minimises the performance index (14) takes the following form,…”
Section: Optimal Controller Lawmentioning
confidence: 99%
“…The quadratic distance has been used in many papers to characterise the distance between the actual pdf and the desire pdf [10]- [12]. As many distributions can be expressed by an exponential function, Kullback-Leibler divergence (KLD) can be considered as a convenient scale to characterise the distance between two distributions [13]- [15]. Therefore, Fully Probabilistic Design (FPD) ( [13]- [16]), inspired by Bayesian approach to adaptive control design [17], which is proposed based on KLD, is applied in this paper.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The minimisation of the cost-to-go function (3) with respect to control law, c(u t | x t−1 ) is shown in [8,9] to be given by…”
Section: Probabilistic Control Objectivementioning
confidence: 99%
“…The pdf of optimal controller, c * (u t |x t ), minimizing the cost-to-go function (6) is determined by the following backward recursion, see e.g. Herzallah & Kárný (2011);Herzallah (2013)…”
Section: Preliminariesmentioning
confidence: 99%