Probabilistic DHP adaptive critic for nonlinear stochastic control systems

2019 IEEE 15th International Conference on Control and Automation (ICCA)

Zafar

2019

Self Cite

In this paper, a novel algorithm based on fully probabilistic design (FPD) is proposed for a class of linear stochastic dynamic processes with multiplicative noise. Compared with the traditional FPD, the new procedure is presented to deal with multiplicative noise and the system parameters are estimated online by the linear optimisation. The performance index is characterised by the Kullback-Leibler divergence (KLD). The generalised probabilistic control law is obtained by solving the Riccatti equation while taking the multiplicative noise into consideration. To demonstrate the effectiveness of the proposed method, a numerical example is given in comparison with the traditional FPD.

“…Based on the Fully Probabilistic Design (FPD) [13]- [15], the control law c * (u k−1 |x k−1 ) which minimises the performance index (14) takes the following form,…”

Section: Optimal Controller Lawmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Fully Probabilistic Design for Stochastic Discrete System with Multiplicative Noise

Zhou

2019 IEEE 15th International Conference on Control and Automation (ICCA)

Zafar

2019

Self Cite

“…The minimisation of the cost-to-go function (3) with respect to control law, c(u t | x t−1 ) is shown in [8,9] to be given by…”

Section: Probabilistic Control Objectivementioning

confidence: 99%

Generalised Probabilistic Control Design for Uncertain Stochastic Control Systems

Asian Journal of Control

2018

Self Cite

In this paper a novel generalised fully probabilistic controller design for the minimisation of the Kullback-Leibler divergence between the actual joint probability density function (pdf) of the closed loop control system, and an ideal joint pdf is presented for a linear Gaussian uncertain class of stochastic systems. A single layer neural network is used to approximate the probability density function of the system dynamics. The generalised probabilistic control law is obtained by solving the recurrence equation of dynamic programming to the fully probabilistic design control problem while taking into consideration the dependency of the parameters of the estimated probability density function of the system dynamics on the input values. It is shown to be of the class of cautious type controllers which accurately minimises the value of the Kullback-Leibler divergence without disregarding the variance of the model prediction as an element to be minimised. Comparison of theoretical and numerical results obtained from the F-16 fighter aircraft application with existing state-of-the-art demonstrates the effectiveness of the proposed method.

“…The pdf of optimal controller, c * (u t |x t ), minimizing the cost-to-go function (6) is determined by the following backward recursion, see e.g. Herzallah & Kárný (2011);Herzallah (2013)…”

Section: Preliminariesmentioning

confidence: 99%

Towards probabilistic synchronisation of local controllers

International Journal of Systems Science

Kárný

2016

Self Cite

The traditional use of global and centralised control methods, fails for large, complex, noisy and highly connected systems, which typify many real world industrial and commercial systems. This paper provides an efficient bottom up design of distributed control in which many simple components communicate and cooperate to achieve a joint system goal. Each component acts individually so as to maximise personal utility whilst obtaining probabilistic information on the global system merely through local message-passing. This leads to an implied scalable and collective control strategy for complex dynamical systems, without the problems of global centralised control. Robustness is addressed by employing a fully probabilistic design, which can cope with inherent uncertainties, can be implemented adaptively and opens a systematic rich way to information sharing. This paper opens the foreseen direction and inspects the proposed design on a linearised version of coupled map lattice with spatiotemporal chaos. A version close to linear quadratic design gives an initial insight into possible behaviours of such networks.