2021
DOI: 10.1109/access.2021.3086002
|View full text |Cite
|
Sign up to set email alerts
|

Research on Bidding Strategy of Thermal Power Companies in Electricity Market Based on Multi-Agent Deep Deterministic Policy Gradient

Abstract: With the continuous improvement of new energy penetration in the power system, the price of the spot market of power frequently fluctuates greatly, which damages the income of a large number of thermal power enterprises. In order to lock in the profit, thermal power enterprises should turn the main target of profit to the medium and long-term power market. With the continuous advancement of the reform in China's power system, major changes have taken place in the medium and long-term power transactions, includ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(4 citation statements)
references
References 33 publications
0
4
0
Order By: Relevance
“…For instance, Wang et al (2019) formulated the sequential energy bidding problem as a Markov decision process (MDP) and utilized the classic Q-learning method to reflect personalized bidding preferences. Liu et al (2021) used the multi-agent deep deterministic policy gradient (MADDPG) algorithm to address the non-cooperative and cooperative energy trading games between power companies. Tao et al (2022) utilized the deep deterministic policy gradient (DDPG) method to generate the bidding price and bidding volume for an electric vehicle aggregator, which reduced the reliance of algorithm performance on the stochastic model.…”
Section: Related Workmentioning
confidence: 99%
“…For instance, Wang et al (2019) formulated the sequential energy bidding problem as a Markov decision process (MDP) and utilized the classic Q-learning method to reflect personalized bidding preferences. Liu et al (2021) used the multi-agent deep deterministic policy gradient (MADDPG) algorithm to address the non-cooperative and cooperative energy trading games between power companies. Tao et al (2022) utilized the deep deterministic policy gradient (DDPG) method to generate the bidding price and bidding volume for an electric vehicle aggregator, which reduced the reliance of algorithm performance on the stochastic model.…”
Section: Related Workmentioning
confidence: 99%
“…They use the DQN reinforcement learning algorithm to maximise profit in peer-to-peer energy trading. Liu et al [42] also investigate the peerto-peer trading problem in a local energy market. They find that the community modelled are able to increase profitability through the use of DQN RL.…”
Section: Literature Review 41 Reinforcement Learningmentioning
confidence: 99%
“…Market Type Application Algorithm Used 2021 Bose S. [6] Local energy market Peer to peer trading Roth-Erev 2021 Naseri N. [55] Local energy market Tariff design SARIMAX, MDP 2021 Tang C. [78] International/National Tariff design Novel WoLF-PHC 2021 Liu D. [42] International/National Bidding strategies, Peerto-Peer MADDPG 2021 Deng C. [12] International/National Demand response DDPG 2021 Viehmann J. [82] International/National Market investigation Q-Learning 2020 Tomin N. [80] Microgrid Electricity grid control Q-Learning 2020 Liang Y.…”
Section: Year First Authormentioning
confidence: 99%
“…Furthermore, from the standpoint of power suppliers and consumers, strategic bidder faces difficulties, this category of transformation is also rationalized by the existing condition as such an unpreventable social assistance necessity [7,8]. Researchers develop bidding strategies for power provider organizations [10], facing challenges due to uncertain demand and rival actions, addressing various timelines [11]. In a perfect electricity market, power suppliers bid primarily on marginal cost.…”
Section: Introductionmentioning
confidence: 99%