In this paper, a new value iteration (VI) based method is proposed to solve the H ∞ control problem of continuous-time linear systems. The problem is transformed into solving a nonlinear differential equation, and the local stability of its solution is proved. Then a VI based iteration scheme for getting the approximation of the H ∞ optimal controller is proposed. Compared with the existing results, the proposed scheme does not need any special initial matrix or extra condition to solve the problem and provides a relatively small disturbance attenuation bound. The related result is also applied to the quadratic guaranteed cost control of linear time-delay systems. Simulation examples verify the effectiveness of the proposed schemes.
K E Y W O R D Sadaptive dynamic programming, linear H ∞ control, model free control, time-delay systems, zero-sum differential games
INTRODUCTIONOver the past few decades, the analysis and control of systems with disturbance and complex dynamics have received considerable attention in the control community. 1,2 As one of the most popular methods, H ∞ control, which studies the problem of the worst-case controller design, [3][4][5] was established and flourished. One way for solving the H ∞ control problem is to treat it as an infinite-horizon zero-sum game problem where the input of the system tries to minimize the infinite quadratic cost function and the disturbance tries to maximize it. Especially, by treating the delayed state as the disturbance, the H ∞ optimal control method can also solve the quadratic cost guaranteed control of systems with state delays. 6 The synthesis of the H ∞ controller needs to solve the Hamilton-Jacobi-Isaacs equations for nonlinear systems 7 and the game algebraic Riccati equation (GARE) for linear systems. 8 For linear systems, solving the GARE can be divided into two categories: the direct method 9,10 and the iterative method. 11,12 Both methods mentioned above need the exact system information of the original systems, which is however hard to be satisfied since in practice the system parameters usually contain uncertainties and thus are unknown. Therefore, how to solve the H ∞ optimal control problems without using the exact system parameters has attracted much attention in the literature.One way for dealing with the system uncertainties is to use the adaptive dynamic programming (ADP) method. In recent years, ADP has attracted more and more attention from both control theorists and engineers. It overcomes the curse of dimensionality 13,14 and acts as a promising methodology for solving the optimal control problem. [15][16][17][18] By ADP,