2020
DOI: 10.1002/rnc.5058
|View full text |Cite
|
Sign up to set email alerts
|

Optimal control of a two‐wheeled self‐balancing robot by reinforcement learning

Abstract: Summary This article concerns optimal control of the linear motion, tilt motion, and yaw motion of a two‐wheeled self‐balancing robot (TWSBR). Traditional optimal control methods for the TWSBR usually require a precise model of the system, and other control methods exist that achieve stabilization in the face of parameter uncertainties. In practical applications, it is often desirable to realize optimal control in the absence of the precise knowledge of the system parameters. This article proposes to use a new… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 26 publications
(11 citation statements)
references
References 20 publications
0
11
0
Order By: Relevance
“…Next, combining inequalities (15) and (16) and using (10), it may be deduced that the estimation error x(t) and the adaptation error ρ(t) converge to a compact set whose radius may be reduced by choosing a sufficiently large value of the design parameter θ.…”
Section: Adaptive Observer For States and Terrain Inclinationmentioning
confidence: 99%
See 1 more Smart Citation
“…Next, combining inequalities (15) and (16) and using (10), it may be deduced that the estimation error x(t) and the adaptation error ρ(t) converge to a compact set whose radius may be reduced by choosing a sufficiently large value of the design parameter θ.…”
Section: Adaptive Observer For States and Terrain Inclinationmentioning
confidence: 99%
“…Furthermore, common networked control strategies have been implemented in [15] for stabilizing a two-wheeled inverted pendulum robot over a wireless channel despite time-varying delays and paket loss. Also, a new feedback reinforcement learning method was proposed in [16] to solve the LQR control problem for the two-wheeled self-balancing robot. e suggested method scheme was completely online and did not require any knowledge of the system parameters.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, reinforcement learning (RL) and approximate/adaptive dynamic programming (ADP) algorithms are widely applied to solve optimal control problems. [14][15][16][17][18][19] Moreover, for MJSs, the optimal controller can also be obtained through online learning by using RL and ADP algorithm. In He et al, 20 an online adaptive optimal control scheme is developed by a novel policy iteration algorithm.…”
Section: Introductionmentioning
confidence: 99%
“…The disadvantage of this approach is the need for an explicit knowledge of the system model; therefore, model-free methods working based on the concept of reinforcement learning (RL) have been proposed in the literature. This method has been effectively used to solve the regulation problem for the nonlinear systems and learn the optimal control solution in real-time while cutting the need for the complete knowledge of the system dynamics [5]- [12].…”
Section: Introductionmentioning
confidence: 99%