2019
DOI: 10.3390/app9091807
|View full text |Cite
|
Sign up to set email alerts
|

Data-Driven Model-Free Tracking Reinforcement Learning Control with VRFT-based Adaptive Actor-Critic

Abstract: This paper proposes a neural network (NN)-based control scheme in an Adaptive Actor-Critic (AAC) learning framework designed for output reference model tracking, as a representative deep-learning application. The control learning scheme is model-free with respect to the process model. AAC designs usually require an initial controller to start the learning process; however, systematic guidelines for choosing the initial controller are not offered in the literature, especially in a model-free manner. Virtual Ref… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
17
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 42 publications
(18 citation statements)
references
References 57 publications
0
17
0
1
Order By: Relevance
“…Compared to the methods in [32,33], the whole operation process is analyzed in this paper. Reinforcement learning based algorithms were employed to solve optimal control problems in [34]. An adaptive actor-critic learning framework is carefully designed with the virtual reference feedback tuning.…”
Section: Related Workmentioning
confidence: 99%
“…Compared to the methods in [32,33], the whole operation process is analyzed in this paper. Reinforcement learning based algorithms were employed to solve optimal control problems in [34]. An adaptive actor-critic learning framework is carefully designed with the virtual reference feedback tuning.…”
Section: Related Workmentioning
confidence: 99%
“…It is noted that most of the schemes mentioned above need to establish neural networks to design controllers, which makes preparing the external testing signals and training processes inescapable. Recently, some useful results have been reported for unknown multiagent systems, such as Model-Free Adaptive Control (MFAC) [23]- [24], Q-Learning [25]- [27], Iterative Feedback Tuning (IFT) [28]- [29], Simultaneous Perturbation Stochastic Approximation (SPSA) [30]- [31], Iterative Learning Control (ILC) [32]- [38], Virtual Reference Feedback Tuning (VRFT) [39].…”
Section: Introductionmentioning
confidence: 99%
“…The influence of uncertainty on matrix calculation was solved by means of set membership degree method and band position representation method. In addition to the above documents, adaptive actor-critic data-driven model-free tracking reinforcement learning control based on virtual reference feedback tuning, a programmable logic controller (PLC)-based fractional water level control method, and a Takagi-Sugeno fuzzy model based on linear matrix inequality fault-tolerant control were also proposed [42][43][44].…”
Section: Introductionmentioning
confidence: 99%