2020
DOI: 10.1109/access.2020.2983606
|View full text |Cite
|
Sign up to set email alerts
|

Deep Q-Network Based Energy Scheduling in Retail Energy Market

Abstract: As a significant component of electric energy trades, retail electric market (REM) can effectively alleviate the pressure of load demand from the power grid. However, the load demand uncertainty of customers becomes a nodus because retail electricity providers (REPs) should predict the load demand when trading with wholesaler electricity provider (WEP) based on the interaction. Therefore, in this paper, we propose an optimal energy scheduling scheme in REM with consideration of the influence of decisions made … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 33 publications
0
3
0
Order By: Relevance
“…In reinforcement learning, an autonomous agent interacts with the environment through its own experience, [39] observes the state of the environment to take action, and learns its own behaviors with the help of feedback from the environment to maximize the cumulative rewards. These behaviors tend to become more optimal as the agent gains experience.…”
Section: Attention Mechanismmentioning
confidence: 99%
See 1 more Smart Citation
“…In reinforcement learning, an autonomous agent interacts with the environment through its own experience, [39] observes the state of the environment to take action, and learns its own behaviors with the help of feedback from the environment to maximize the cumulative rewards. These behaviors tend to become more optimal as the agent gains experience.…”
Section: Attention Mechanismmentioning
confidence: 99%
“…These behaviors tend to become more optimal as the agent gains experience. [39,40] This can be solved by employing a Markov decision process [41] consisting of five different entities: state, action, reward, policy, and value. Reinforcement learning can be represented by a quaternion ⟨S, A, P, R⟩, where S is the set of states, A is the set of actions, P is the state transfer function, and R is the reward function.…”
Section: Attention Mechanismmentioning
confidence: 99%
“…The agent operates on the environment and performs a series of actions to solve certain problems. [37,38] Each time step t can be represented by a state s t ∈ S, where S is the state space, showing how the environment is represented. The feedback loop between the agent and the environment is called a Markov decision process (MDP).…”
Section: Reinforcement Learningmentioning
confidence: 99%