Steady-State Error Compensation in Reference Tracking and Disturbance Rejection Problems for Reinforcement Learning-Based Control

Weber, Daniel; Schenke, Maximilian; Wallscheid, Oliver

doi:10.48550/arxiv.2201.13331

Cited by 1 publication

(1 citation statement)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…16 The authors of this study also implemented steady-state error compensation as a means of enhancing the efficacy of the DDPG agent. 17 Actor-critic methods such as DDPG are in general reinforcement learning techniques in which two neural networks, named "actor" and "critic" are present. 18,19 The actor learns a policy that is modeled by a parameterized distribution, while the critic learns either a value function (i.e., a function V(s) that provides the expected return when the initial state is s and a given policy is applied) or an action-value function (i.e., a function Q(s, a) that provides the expected return obtained when the initial state is s, an arbitrary action a which may not have been obtained from the given policy is taken at s, while the given policy is used for all subsequent discrete time instants) and uses it to evaluate the performance of the policy optimized by the actor.…”

Section: Introductionmentioning

confidence: 99%

Deep reinforcement learning for PMSG wind turbine control via twin delayed deep deterministic policy gradient (TD3)

Zholtayev,

Rubagotti,

2024

Optim Control Appl Methods

View full text Add to dashboard Cite

This article proposes the use of a deep reinforcement learning method—and precisely a variant of the deep deterministic policy gradient (DDPG) method known as twin delayed DDPG, or TD3—for maximum power point tracking in wind energy conversion systems that use permanent magnet synchronous generators (PMSGs). An overview of the TD3 algorithm is provided, together with a detailed description of its implementation and training for the considered application. Simulation results are provided, also including a comparison with a model‐based control method based on feedback linearization and linear‐quadratic regulation. The proposed TD3‐based controller achieves a satisfactory control performance and is more robust to PMSG parameter variations as compared to the presented model‐based method. To the best of the authors' knowledge, this article presents for the first time an approach for generating both speed and current control loops using DRL for wind energy conversion systems.

show abstract