Optimal Output Feedback Control of Nonlinear Partially-Unknown Constrained-Input Systems Using Integral Reinforcement Learning

Ren, Ling; Zhang, Guoshan; Mu, Chaoxu

doi:10.1007/s11063-019-10072-2

Cited by 8 publications

(11 citation statements)

References 35 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Proof The proof that the state estimation error

\tilde{x}

and the NN weight estimation error

{\tilde{W}}_f

are UUB does not much differ from that of [17, 19] and thus it is omitted here for brevity. □…”

Section: Neural Network Observermentioning

confidence: 99%

“…In [4], an adaptive synchronous PI algorithm with actor-critic architecture is developed, which can adjust the critic and actor NNs simultaneously and can be called synchronous IRL (SIRL). In [17], SIRL algorithm based on neural network observer (NNO) is designed to solve HJB equation for the nonlinear system with the unknown drift dynamics and unmeasurable state. In practical application, if the input saturation of the actuator is not considered, the designed controller may lead to system instability or worse stability, so more and more researchers have studied the problem of the actuator saturation in the design of optimal control algorithms [12,18,19].…”

Section: Introductionmentioning

confidence: 99%

“…Compared with the dynamics identification method proposed in [11], we adopt the NNO to estimate unmeasurable system state online only by the input-output data of the system, which has good practical application value. Compared with the time-triggered control schemes mentioned above, such as [16][17][18][19], the event-triggered-based control algorithm proposed in this paper can reduce computation and communication burden and can reduce the update frequency of the controller. Compared with ADP-based event-triggered control approaches proposed in [27,28], the prior knowledge of drift dynamics in this paper is relaxed, and the system state is regarded as unmeasurable.…”

Section: Introductionmentioning

confidence: 99%

“…RL based on policy iteration (PI) technology is an effective method to deal with the optimization problems, and PI technology is implemented by alternating actions of policy evaluation and policy improvement with critic–actor architecture, where two kinds of neural networks (NNs) as the critic NN and the actor NN are used to approximate the optimal cost function and optimal control policy, respectively [14, 15]. As an improvement of RL algorithm, integral RL (IRL) has been investigated in [4, 16, 17]. In [16], a novel PI algorithm with critic‐actor architecture considered as IRL is proposed to solve the optimal control problem by introducing integral Bellman equation that the knowledge of system internal dynamic is no longer required, and the integral term in policy evaluation step can be addressed as the reinforcement signal over the time interval

\left[t,t&#x0002B;T\right)

.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Event‐triggered‐based integral reinforcement learning output feedback optimal control for partially unknown constrained‐input nonlinear systems

Zou

Zhang

2023

Asian Journal of Control

Self Cite

View full text Add to dashboard Cite

In this paper, an adaptive output feedback event‐triggered optimal control algorithm is proposed for partially unknown constrained‐input continuous‐time nonlinear systems. First, a neural network observer is constructed to estimate unmeasurable state. Next, an event‐triggered condition is established, and only when the event‐triggered condition is violated will the event be triggered and the state be sampled. Then, an event‐triggered‐based synchronous integral reinforcement learning (ET‐SIRL) control algorithm with critic‐actor neural networks (NNs) architecture is proposed to solve the event‐triggered Hamilton–Jacobi–Bellman equation under the established event‐triggered condition. The critic and actor NNs are used to approximate cost function and optimal event‐triggered optimal control law, respectively. Meanwhile, the event‐triggered‐based closed‐loop system state and all the neural network weight estimation errors are uniformly ultimately bounded proved by Lyapunov stability theory, and there is no Zeno behavior. Finally, two numerical examples are presented to show the effectiveness of the proposed ET‐SIRL control algorithm.

show abstract

“…Proof The proof that the state estimation error

\tilde{x}

and the NN weight estimation error

{\tilde{W}}_f

are UUB does not much differ from that of [17, 19] and thus it is omitted here for brevity. □…”

Section: Neural Network Observermentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

\left[t,t&#x0002B;T\right)

.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Event‐triggered‐based integral reinforcement learning output feedback optimal control for partially unknown constrained‐input nonlinear systems

Zou

Zhang

2023

Asian Journal of Control

Self Cite

View full text Add to dashboard Cite

show abstract

“…Although control-input constraints (CICs) have been considered in some AFS studies Ko et al, 2002;Viswamurthy and Ganguli, 2008;Wang et al, 2011), none of the existing solutions address the problem in the sense of optimal control. Despite numerous methods of nonlinear optimal control online synthesis (NOCOS) for systems with CICs being available (Liang et al, 2019;Na et al, 2019;Ren et al, 2019), these methods are inapplicable to AFS because of problems related to stability, application scope, and real-time implementation. Moreover, these methods are limited to locally parameter-invariant nonlinear systems, whereas aeroelastic systems are parameter varying as the dynamics also change nonlinearly with the freestream airflow speed.…”

Section: Introductionmentioning

confidence: 99%

A neural network approach for improving airfoil active flutter suppression under control-input constraints

Tang

Chen

Tian

et al. 2020

Journal of Vibration and Control

View full text Add to dashboard Cite

This study deals with improving airfoil active flutter suppression under control-input constraints from the optimal control perspective by proposing a novel optimal neural-network control. The proposed approach uses a modified value function approximation dynamically tuned by an extended Kalman filter to solve the Hamilton–Jacobi–Bellman equality online for continuously improved optimal control to address optimality in parameter-varying nonlinear systems. Control-input constraints are integrated into the controller synthesis by introducing a generalized nonquadratic cost function for control inputs. The feasibility of using a performance index involving the nonquadratic control-input cost with the modified value function approximation is examined through the Lyapunov stability analysis. Wind tunnel experiments were conducted for controller validation, where an optimal controller synthesized offline via linear parameter-varying technique was used as a benchmark and compared. It is shown, both theoretically and experimentally, that the proposed method can effectively improve airfoil active flutter suppression under control-input constraints.

show abstract

Optimal control of partially unknown constrained‐input systems: A dynamic event‐triggered‐based approach

Zou

Zhang

2023

Optim Control Appl Methods

Self Cite

View full text Add to dashboard Cite

This article presents an identifier‐based dynamic event‐triggered optimal control scheme for partially unknown constrained‐input systems. First, an event‐triggered‐based neural network (NN) identifier is constructed to estimate the unknown system dynamics. Then, an adaptive dynamic programming algorithm with actor‐critic NN structure is adopted to obtain an approximate solution of the Hamilton–Jacobi–Bellman equation. The above considers that transmitted measurements are only available at the triggering instants, and the update of all three NN weights depends on the established dynamic event‐triggered mechanism. Different from existing static event‐triggered mechanism, the proposed dynamic event‐triggered mechanism can further obtain a reasonable trade‐off between performance and communication resources by introducing a dynamic variable, and the Zeno behavior can be excluded by devising an exponential term. It is proved that all the closed‐loop system signals are uniformly ultimately bounded under the established event‐triggered mechanism. Finally, two numerical examples are provided, including the spring‐mass‐damper system, to validate the proposed control scheme.

show abstract

Optimal Output Feedback Control of Nonlinear Partially-Unknown Constrained-Input Systems Using Integral Reinforcement Learning

Cited by 8 publications

References 35 publications

Event‐triggered‐based integral reinforcement learning output feedback optimal control for partially unknown constrained‐input nonlinear systems

Event‐triggered‐based integral reinforcement learning output feedback optimal control for partially unknown constrained‐input nonlinear systems

A neural network approach for improving airfoil active flutter suppression under control-input constraints

Optimal control of partially unknown constrained‐input systems: A dynamic event‐triggered‐based approach

Contact Info

Product

Resources

About