2020
DOI: 10.1109/tsg.2020.2986333
|View full text |Cite
|
Sign up to set email alerts
|

Deep Reinforcement Learning-Based Energy Storage Arbitrage With Accurate Lithium-Ion Battery Degradation Model

Abstract: Accurate estimation of battery degradation cost is one of the main barriers for battery participating on the energy arbitrage market. This paper addresses this problem by using a model-free deep reinforcement learning (DRL) method to optimize the battery energy arbitrage considering an accurate battery degradation model. Firstly, the control problem is formulated as a Markov Decision Process (MDP). Then a noisy network based deep reinforcement learning approach is proposed to learn an optimized control policy … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
104
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
7
1
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 215 publications
(104 citation statements)
references
References 23 publications
(36 reference statements)
0
104
0
Order By: Relevance
“…Hence, we remove the dependence on the model-based EKF approach. The authors of [29] also highlighted the effectiveness of reinforcement learning in designing an optimal control policy to reduce transmission losses.…”
Section: Related Workmentioning
confidence: 99%
“…Hence, we remove the dependence on the model-based EKF approach. The authors of [29] also highlighted the effectiveness of reinforcement learning in designing an optimal control policy to reduce transmission losses.…”
Section: Related Workmentioning
confidence: 99%
“…al. [28], is to add future prices to the state. Thus, for the FQI-based controller no additional adaptations are needed to the state, but every training day the rewards in F need to be recalculated.…”
Section: B Belpex Pricementioning
confidence: 99%
“…Model-free RL ConvLSTM+DQN 2020 [142] Energy storage arbitrage A battery energy arbitrage is modelled to learn optimal control policy for storage charging/discharging strategy via a battery degradation mode ConvLSTM for price prediction DDQN 2020 [143] Battery management The optimal operation of the community battery is controlled by the double deep Q-learning method.…”
Section: Milpmentioning
confidence: 99%