2019
DOI: 10.1007/s00521-019-04180-2
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information

Abstract: providing relevant details, so we can investigate your claim. Download date:03. Nov. 2020 Abstract-In this paper, an online adaptive optimal control problem of a class of continuous-time Markov jump linear systems (MJLSs) is investigated by using a parallel reinforcement learning (RL) algorithm with completely unknown dynamics. Before collecting and learning the subsystems information of states and inputs, the exploration noise is firstly added to describe the actual control input. Then, a novel parallel RL a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
33
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 51 publications
(33 citation statements)
references
References 47 publications
0
33
0
Order By: Relevance
“…, A i,1 > 0 and A i,2 > 0 are design parameters, (12) and (13) indicate that when the triggering condition E i ≥ A i,1 z i,2 + A i,2 is satisfied, the input τ i updates its value at t i,a+1 and then is set toτ i (t i,a ) during t ∈ [t i,a , t i,a+1 ). Here, the signalτ i for the event-triggered update of the control input is defined as…”
Section: ϕ In I ] Is a Radial Basis Function Vector And I Is A mentioning
confidence: 99%
See 3 more Smart Citations
“…, A i,1 > 0 and A i,2 > 0 are design parameters, (12) and (13) indicate that when the triggering condition E i ≥ A i,1 z i,2 + A i,2 is satisfied, the input τ i updates its value at t i,a+1 and then is set toτ i (t i,a ) during t ∈ [t i,a , t i,a+1 ). Here, the signalτ i for the event-triggered update of the control input is defined as…”
Section: ϕ In I ] Is a Radial Basis Function Vector And I Is A mentioning
confidence: 99%
“…Remark 3: Compared with the existing works [8]- [10], [14], [15] for single stratospheric airships, the proposed control scheme (13)-(16) deals with the distributed formation control problem for networked multiple stratospheric airships and the local controller (12) updates its value according to the proposed asynchronous triggering rule (13). Although the local controllers (12) are intermittently updated and the leader output signal is only available for some of the followers, the stability of the resulting formation control system can be ensured, as presented in the following section.…”
Section: ϕ In I ] Is a Radial Basis Function Vector And I Is A mentioning
confidence: 99%
See 2 more Smart Citations
“…A lowpass filter was introduced to improve the system damping, but the effect of different grid-connected positions and capacity on the small signal of the system was not considered. At present, neural networks [16,17], deep learning, and machine learning methods have been widely used in various fields, and remarkable results have been achieved [18,19].…”
Section: Introductionmentioning
confidence: 99%