Proceedings of the 28th IEEE Conference on Decision and Control
DOI: 10.1109/cdc.1989.70114
|View full text |Cite
|
Sign up to set email alerts
|

Neural networks for control and system identification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
150
0
1

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 261 publications
(151 citation statements)
references
References 5 publications
0
150
0
1
Order By: Relevance
“…The conditions (13), (14), and (15) are used to propose the RL solution framework as will be explained in the next section.…”
Section: B Hamiltonian Dynamicsmentioning
confidence: 99%
See 1 more Smart Citation
“…The conditions (13), (14), and (15) are used to propose the RL solution framework as will be explained in the next section.…”
Section: B Hamiltonian Dynamicsmentioning
confidence: 99%
“…Following this assessment, the actor and critic weights are updated [12], [13]. The adaptive critics are used to solve the optimal control problem in realtime in [14]. The optimal control problem finds the necessity optimality conditions and hence the optimal strategies [15].…”
Section: Introductionmentioning
confidence: 99%
“…This allows approximate solution of (10), (11). In this section, we review how to implement the HDP value iterations algorithm with two parametric structures such as neural networks (Werbos, 1990) and (Lewis & Jaganathan, 1999). The important point is stressed that the use of two NN, a critic for value function approximation and an action NN for the control, allows the implementation of HDP in the LQR case without knowing the system internal dynamics matrix A.…”
Section: Neural Network Approximation For Value and Actionmentioning
confidence: 99%
“…The HDP value iteration algorithm (Werbos, 1990) is a method to solve the DT HJB online. In this section, a proof of convergence of the HDP algorithm in the general nonlinear discrete-time setting is presented.…”
Section: The Hdp Algorithmmentioning
confidence: 99%
“…The first RL applications on control systems have been found on Werbos [19,20], where the regulation problem was tackled, whose objective is to design a controller for a given process, such that the internal state of this process approaches zero as time increases unbounded. Then, an immediate extension was to apply policy iteration (PI) algorithms to solve the linear quadratic regulator (LQR) problem [4].…”
Section: Introductionmentioning
confidence: 99%