2022
DOI: 10.1016/j.apenergy.2022.119543
|View full text |Cite
|
Sign up to set email alerts
|

A reinforcement and imitation learning method for pricing strategy of electricity retailer with customers’ flexibility

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(1 citation statement)
references
References 29 publications
0
1
0
Order By: Relevance
“…This parallel combination is mainly applied in robotics [28] and urban self-driving scenarios [29]. For applications in the power field, an expert replay buffer is introduced in [30] during the DRL training process to help an electricity retailer learn optimal pricing policy, but the expert policy stored in the expert replay memory is selected from strategies explored by the agent rather than directly using expert demonstrations through IL, which may reduce the efficiency of learning.…”
Section: Introductionmentioning
confidence: 99%
“…This parallel combination is mainly applied in robotics [28] and urban self-driving scenarios [29]. For applications in the power field, an expert replay buffer is introduced in [30] during the DRL training process to help an electricity retailer learn optimal pricing policy, but the expert policy stored in the expert replay memory is selected from strategies explored by the agent rather than directly using expert demonstrations through IL, which may reduce the efficiency of learning.…”
Section: Introductionmentioning
confidence: 99%