2022
DOI: 10.1063/5.0097344
|View full text |Cite
|
Sign up to set email alerts
|

Attention and masking embedded ensemble reinforcement learning for smart energy optimization and risk evaluation under uncertainties

Abstract: Integrating residential-level photovoltaic energy generation and energy storage for the on-grid system are essential to reduce electricity use for residential consumption from the grid. However, reaching a reliable and optimal control policy is highly challenging due to the intrinsic uncertainties in the renewable energy sources and fluctuating demand profile. In this work, we proposed and designed an ensemble deep reinforcement learning (DRL) algorithm combined with risk evaluation to solve the energy optimiz… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 40 publications
0
5
0
Order By: Relevance
“…Network: TCN, BiLSTM, KELM, BPNN, MLP, echo state network (ESN), Elman neural network (ENN), and gradient boosting decision tree (GBDT); Training algorithm: SARSA Qin et al [85] historical load data of the California Independent System Operator (CASIO) from January 1, 2021 to July 5, 2021 PPO guided tree search, the MIQP algorithm with Gurobi 9.1 Sogabe et al [86] optimal energy management in a residential building microgrid mixed-integer linear programming (MILP) Birman et al [60] a range of real-world scenarios Aggregation method Li et al [51] PHEME, RumorEval Naive Bayes, SVM-SGD, Dense, BiLSTM, FastText, TextCNN, VRoC, some combinations of above methods Sharma et al [52] two MEC servers and 30 IoTDs randomly distributed in the squared area with size 50m×50m…”
Section: Datasets and Compared Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Network: TCN, BiLSTM, KELM, BPNN, MLP, echo state network (ESN), Elman neural network (ENN), and gradient boosting decision tree (GBDT); Training algorithm: SARSA Qin et al [85] historical load data of the California Independent System Operator (CASIO) from January 1, 2021 to July 5, 2021 PPO guided tree search, the MIQP algorithm with Gurobi 9.1 Sogabe et al [86] optimal energy management in a residential building microgrid mixed-integer linear programming (MILP) Birman et al [60] a range of real-world scenarios Aggregation method Li et al [51] PHEME, RumorEval Naive Bayes, SVM-SGD, Dense, BiLSTM, FastText, TextCNN, VRoC, some combinations of above methods Sharma et al [52] two MEC servers and 30 IoTDs randomly distributed in the squared area with size 50m×50m…”
Section: Datasets and Compared Methodsmentioning
confidence: 99%
“…Compared with traditional studies using machine learning (ML) or artificial neural network (ANN) prediction methods [79,80], there is a significant gap between the prediction results obtained by these methods and ERL. Liu [83] wind speed prediction deep Q-Network 2021 Jalali et al [84] wind power forecasting Q-learning 2022 Tan et al [31] PM2.5 prediction Sarsa 2022 Qin et al [85] unit commitment problem deep Q-Network 2022 Sogabe et al [86] smart energy optimization and risk evaluation Q-learning 2022 Sharma et al [52] estimating reference evapotranspiration Q-learning 2022 He et al [87] wind farm control deep deterministic policy gradient 2022 Jalali et al [88] solar irradiance forecasting Q-learning 2022 Shi Yin and Hui Liu [61] wind power prediction Q-learning 2023 Yu et al [29] wind power prediction deep deterministic policy gradient…”
Section: Energy and Environment Areamentioning
confidence: 99%
“…Here, T(s t+1 |s t , a t ) is the stochastic transition probability and will be given in more detail in Section 2.4. The integrated final visiting frequency of specified state s i in P(ξ i |θ) under a given trajectory length T for all s t is given as follows and will be used in formula (17) to update the learning parameter θ.…”
Section: Mathematical Problems In Engineeringmentioning
confidence: 99%
“…It is a conventional approach used in financial engineering field. Another one is the test demonstration of the trained model on the data that are not included in the training data, which is a standard measure used in the field of machine learning [17].…”
Section: Risk Evaluation and Policy Verificationmentioning
confidence: 99%
See 1 more Smart Citation