2012
DOI: 10.1016/j.arcontrol.2012.03.004
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement learning and optimal adaptive control: An overview and implementation examples

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
65
0
4

Year Published

2012
2012
2023
2023

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 182 publications
(69 citation statements)
references
References 57 publications
0
65
0
4
Order By: Relevance
“…The ADP technology is used widely in aircraft [25], robot arm [33], Micro-electromechanical-system actuator [34] and turbocharged diesel engine [21] and so on. At present, we just study our method in theory.…”
Section: Resultsmentioning
confidence: 99%
“…The ADP technology is used widely in aircraft [25], robot arm [33], Micro-electromechanical-system actuator [34] and turbocharged diesel engine [21] and so on. At present, we just study our method in theory.…”
Section: Resultsmentioning
confidence: 99%
“…The stability analysis is barely objective but the optimum controller is usually preferred [6] for several practical control systems. By solving the nonlinear HamiltoneJacobieBellman (HJB), adaptive dynamic programming (ADP) schemes had been developed to minimize an infinite cost function for both the error signal and the control effort [7].…”
Section: Introductionmentioning
confidence: 99%
“…As a major control technique of handling parametric uncertainties, adaptive control still remains among the most active research fields in the control community. [1][2][3][4][5][6][7] Usually, adaptive control only achieves asymptotic convergence of tracking errors and does not guarantee convergence of parameter estimation errors without a condition termed persistent excitation (PE). 8 Parameter convergence in adaptive control is desirable as it enhances the overall stability and robustness properties of the closed-loop system, 9 where the robustness properties result from closed-loop exponential stability, and detailed analysis can be referred to the work of Sastry and Bodson.…”
Section: Introductionmentioning
confidence: 99%