2022
DOI: 10.1109/tsg.2022.3158814
|View full text |Cite
|
Sign up to set email alerts
|

Home Energy Recommendation System (HERS): A Deep Reinforcement Learning Method Based on Residents’ Feedback and Activity

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 37 publications
(8 citation statements)
references
References 39 publications
0
8
0
Order By: Relevance
“…For this challenging problem with continuous state space and high-dimensional action space, policy gradient RL methods provide an effective solution approach. Specifically, the Advantage Actor-Critic (A2C) algorithm is known to provide quick convergence in such problems [4], [18], [19], [22], [26], [27]. Using two neural networks (actor and critic) A2C reduces the variance of its predecessor policy gradient algorithm the REINFORCE [26].…”
Section: F Solution Approachmentioning
confidence: 99%
See 1 more Smart Citation
“…For this challenging problem with continuous state space and high-dimensional action space, policy gradient RL methods provide an effective solution approach. Specifically, the Advantage Actor-Critic (A2C) algorithm is known to provide quick convergence in such problems [4], [18], [19], [22], [26], [27]. Using two neural networks (actor and critic) A2C reduces the variance of its predecessor policy gradient algorithm the REINFORCE [26].…”
Section: F Solution Approachmentioning
confidence: 99%
“…Numerous existing works proposed scheduling techniques for the time shiftable appliances (e.g., Dishwasher, washer dryer, EV charging, etc.) of a household to capitalize the TOU tariffs [4]- [6]. Although such DSM techniques can flatten the demand curve to an extent, they do not sufficiently address the increasing stress of EV charging on the distribution transformers since they lack the utility-side management of the problem.…”
Section: Introductionmentioning
confidence: 99%
“…In [88], the building is regarded as a whole, and action is the temperature change from the original schedule. In [91] and [92], RL-based HEM methods are proposed. Environment contains models of various appliances and interaction rules with the grid.…”
Section: Intelligent Load Managementmentioning
confidence: 99%
“…Yu et al (2019) suggested also a model using MDP to schedule optimally HVAC appliances and the energy storage system of a smart home. Finally, Shuvo and Yilmaz (2022), proposed a DFL model that incorporated human feedback in the objective function and human activity data in the reinforcement learning part of it to enhance optimization of energy.…”
Section: Deep Learningmentioning
confidence: 99%