Reinforcement Learning based on Scenario-tree MPC for ASVs

Kordabad, Arash Bahari; Esfahani, Hossein Nejatbakhsh; Lekkas, Anastasios M.; Gros, Sébastien

doi:10.23919/acc50511.2021.9483100

Cited by 15 publications

(1 citation statement)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Model predictive control (MPC) has attained remarkable success in recent decades, because of its disturbance rejection (Draeger et al, 1995) and constraint handling (Morari and Lee, 1999) capabilities. It has been widely applied in various fields (Darby and Nikolaou, 2012), such as robotics, process control and reinforcement learning (Chua et al, 2018; Kordabad et al, 2021; Pfrommer et al, 2022). However, the performance of the MPC controllers can be degraded by a series of factors, including an uncertain system model (Piga et al, 2019), a limited terminal set (Rosolia and Borrelli, 2017, 2018), or an inappropriate objective function (Marco et al, 2016).…”

Section: Introductionmentioning

confidence: 99%

Automatic tuning of robust model predictive control in iterative tasks using efficient Bayesian optimization

Tong,

Du,

Fan

et al. 2023

Transactions of the Institute of Measurement and Control

View full text Add to dashboard Cite

Robust model predictive control (RMPC) is an effective technology for controlling uncertain systems while robustly handling constraints, and its closed-loop performance heavily relies on the selection of objective functions. However, the objective functions are typically chosen to be close to the real control objectives, despite an objective function that leads to less conservative constraints often provides better closed-loop performance. In this paper, we propose an automatic tuning framework for RMPC in iterative tasks. In particular, we parameterize RMPC and develop a Bayesian optimization (BO) method to tune it by solving a black-box optimization problem. We then introduce an efficient transfer learning framework within BO, which speeds up the searching process and enhances the controller performance. The effectiveness of the proposed tuning framework is illustrated on numerical examples.

show abstract