2018
DOI: 10.1002/acs.2949
|View full text |Cite
|
Sign up to set email alerts
|

A set‐based model‐free reinforcement learning design technique for nonlinear systems

Abstract: In this study, we propose an extremum-seeking approach for the approximation of optimal control problems for a class of unknown nonlinear dynamical systems. The technique combines a phasor extremum-seeking controller with a reinforcement learning strategy. The learning approach is used to estimate the value function of an optimal control problem of interest. The phasor extremum-seeking controller implements the approximate optimal controller. The approach is shown to provide reasonable approximations of optima… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 24 publications
0
2
0
Order By: Relevance
“…The incorporation of deep neural networks (DNN) as function approximators into the RL framework is known as deep reinforcement learning (DRL) . Complex control problems with discrete and continuous action spaces such as robotic manipulation, bipedal locomotion, and tracking control problems have been successfully solved using DRL [10,11,12,13,14]. Inspired by the ideas in RL, in this paper we explore the implementation of a non-invasive, full-online and model-free outer-loop control using DRL in the context of tracking control problems.…”
Section: Introductionmentioning
confidence: 99%
“…The incorporation of deep neural networks (DNN) as function approximators into the RL framework is known as deep reinforcement learning (DRL) . Complex control problems with discrete and continuous action spaces such as robotic manipulation, bipedal locomotion, and tracking control problems have been successfully solved using DRL [10,11,12,13,14]. Inspired by the ideas in RL, in this paper we explore the implementation of a non-invasive, full-online and model-free outer-loop control using DRL in the context of tracking control problems.…”
Section: Introductionmentioning
confidence: 99%
“…The model is assumed to be partially known, and an integral reinforcement learning approach is used to iteratively learn the solution of the associated algebraic Riccati equation online. The problem of optimal control over an infinite time horizon for an uncertain nonlinear system, affine in the control, is studied in the paper of Guay and Khalid . The idea proposed in this paper is based on the merger of reinforcement learning used to estimate the optimal cost function and an extremum seeker used to estimate the optimal control.…”
Section: Introductionmentioning
confidence: 99%