2021
DOI: 10.3390/en14206695
|View full text |Cite
|
Sign up to set email alerts
|

Twin-Delayed Deep Deterministic Policy Gradient for Low-Frequency Oscillation Damping Control

Abstract: Due to the large scale of power systems, latency uncertainty in communications can cause severe problems in wide-area measurement systems. To resolve this issue, a significant amount of past work focuses on using emerging technology, including machine learning methods such as Q-learning, for addressing latency issues in modern controls. Although the method can deal with the stochastic characteristics of communication latency, the Q-values can be overestimated in Q-learning methods, leading to high bias. To add… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 39 publications
0
3
0
Order By: Relevance
“…Currently, the twin-delayed deep deterministic policy gradient (TD3) is the more popular algorithm in DRL [27] [28], which has been successfully applied in many different areas [29][30][31][32][33][34]. Given the time-varying characteristics in urban rail transit systems, we propose a modified TD3 algorithm by introducing a priority experience sampling strategy, termed A-TD3 (Annealing bias-priority experience replay twin delayed deep deterministic policy gradient algorithm).…”
Section: Introductionmentioning
confidence: 99%
“…Currently, the twin-delayed deep deterministic policy gradient (TD3) is the more popular algorithm in DRL [27] [28], which has been successfully applied in many different areas [29][30][31][32][33][34]. Given the time-varying characteristics in urban rail transit systems, we propose a modified TD3 algorithm by introducing a priority experience sampling strategy, termed A-TD3 (Annealing bias-priority experience replay twin delayed deep deterministic policy gradient algorithm).…”
Section: Introductionmentioning
confidence: 99%
“…Meanwhile, DRL is robust to the uncertainties of the system, i.e., control signals latency, environment noise, and load variations. A Deep Deterministic Policy Gradient (DDPG) based method and the twin-delayed DDPG method are proposed to overcome various communication delays during damping control [11], [12]. To solve the high dimensionality problem of power systems, Mukherjee et al [13] introduce two model reduction approaches for scalable DRL wide-area damping control.…”
Section: Introductionmentioning
confidence: 99%
“…Motivated by the previous investigations, this paper employs the DRL principle to design a parameter tuning model for a two-loop autopilot using the TD3 algorithm [33], which has the following striking advantages over existing autopilot design schemes:…”
Section: Introductionmentioning
confidence: 99%