2023
DOI: 10.1016/j.apenergy.2022.120598
|View full text |Cite
|
Sign up to set email alerts
|

Online transfer learning strategy for enhancing the scalability and deployment of deep reinforcement learning control in smart buildings

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8

Relationship

2
6

Authors

Journals

citations
Cited by 45 publications
(13 citation statements)
references
References 72 publications
0
13
0
Order By: Relevance
“…where b occ,k is a boolean variable being 1 when occupants are present or 0 otherwise. A temperature violation T viol,k is calculated as the absolute temperature difference between the indoor temperature and the upper T i or lower limit T i , and can have different expressions depending on the value of the indoor temperature T i [9]:…”
Section: Methodsmentioning
confidence: 99%
“…where b occ,k is a boolean variable being 1 when occupants are present or 0 otherwise. A temperature violation T viol,k is calculated as the absolute temperature difference between the indoor temperature and the upper T i or lower limit T i , and can have different expressions depending on the value of the indoor temperature T i [9]:…”
Section: Methodsmentioning
confidence: 99%
“…The results suggested that transferring the DRL control policy from one building to another within the energy community yielded comparable performance while reducing the training costs. Coraci et al (2023b) developed an online transfer learning approach that exploits two knowledge-sharing techniques, weight-initialisation and IL, to transfer a DRL controller pre-trained on a source office building that minimises electricity cost while enhancing indoor temperature conditions by managing a cooling system. The proposed online transfer learning approach aims to replicate real-world implementation by simulating the transferred DRL agent in the target buildings for a single episode.…”
Section: Related Work On Tl Applications For Reinforcement Learning C...mentioning
confidence: 99%
“…Nevertheless, in the early stages of the training period, the agent possesses limited knowledge about the control problem, and there exists a significant risk that the chosen controller actions yield suboptimal performance. In this framework, the memory buffer of the online DRL agent is initialised with transitions acquired from the operation of the RBC, which is essentially an imitation learning approach (Coraci et al 2023b). The performance of the online DRL strategy depends strongly on the value of the number of gradient steps and learning rate.…”
Section: Performance Benchmarking Of Online Tl Strategy On Target Bui...mentioning
confidence: 99%
“…Transfer Learning is emerging as a promising strategy to improve the wide-spread application of models. However, there are still significant research gaps regarding the identification of suitable training sources and the prediction of the performance after the transfer to another building [9,10].…”
Section: Motivationmentioning
confidence: 99%