2021
DOI: 10.3389/frobt.2021.733104
|View full text |Cite
|
Sign up to set email alerts
|

Safe Model-Based Reinforcement Learning for Systems With Parametric Uncertainties

Abstract: Reinforcement learning has been established over the past decade as an effective tool to find optimal control policies for dynamical systems, with recent focus on approaches that guarantee safety during the learning and/or execution phases. In general, safety guarantees are critical in reinforcement learning when the system is safety-critical and/or task restarts are not practically feasible. In optimal control theory, safety requirements are often expressed in terms of state and/or control constraints. In rec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
9
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(9 citation statements)
references
References 34 publications
0
9
0
Order By: Relevance
“…In recent years, significant progress has been made in developing safe model-based reinforcement learning (SMBRL) techniques to learn safe controllers for different classes of systems. [6][7][8][9][10][11][12][13][14][15][16] While Markov decision process (MDP) based SMBRL methods have been available for discrete time systems with finite state and action spaces, [6][7][8][9] synthesizing online controllers for systems in continuous time, under output feedback, while guaranteeing stability and safety is still a challenging problem.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations
“…In recent years, significant progress has been made in developing safe model-based reinforcement learning (SMBRL) techniques to learn safe controllers for different classes of systems. [6][7][8][9][10][11][12][13][14][15][16] While Markov decision process (MDP) based SMBRL methods have been available for discrete time systems with finite state and action spaces, [6][7][8][9] synthesizing online controllers for systems in continuous time, under output feedback, while guaranteeing stability and safety is still a challenging problem.…”
Section: Introductionmentioning
confidence: 99%
“…While the control barrier function results in safety guarantees, the existence of a smooth value function, in spite of a nonsmooth cost function, needs to be assumed. 16 This article is inspired by the nonlinear coordinate transformation first introduced in Reference 18. Leveraging the results of Reference 18, a barrier transformation (BT) to construct an equivalent, unconstrained optimal control problem from a state-constrained optimal control problem was introduced in Reference 14.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Introducing theoretical guarantees on reinforcement learning methods is an active area of research. 10 The problem of defining a search termination criterion like we seek in the multi-region scenarios appears novel in the problem space.…”
Section: Introductionmentioning
confidence: 99%