2021
DOI: 10.1109/access.2021.3114161
|View full text |Cite
|
Sign up to set email alerts
|

Research and Application of Predictive Control Method Based on Deep Reinforcement Learning for HVAC Systems

Abstract: Energy efficiency and consumption control remain a significant topic in the area of Heating, Ventilation, and Air Conditioning (HVAC) systems. Deep reinforcement learning (DRL) is an emerging technique to optimize energy consumption. Its advantage lies in the ability to tackle the time-series nature of energy data and complexity brought by environmental factors. However, most DRL algorithms have not considered both time-of-use electricity pricing and thermal comfort. This paper proposed a hybrid approach based… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(7 citation statements)
references
References 28 publications
0
7
0
Order By: Relevance
“…The results showed that the predictive control method could ensure thermal comfort and air quality in the indoor environment while minimizing energy consumption. Fu et al [35] established a thermal dynamics model to predict the future trend of HVAC systems. Twin delayed deep deterministic policy gradient algorithm and model predictive control (TD3-MPC) were proposed to pre-adjust building temperatures at off-peak times.…”
Section: Predictive Controlmentioning
confidence: 99%
“…The results showed that the predictive control method could ensure thermal comfort and air quality in the indoor environment while minimizing energy consumption. Fu et al [35] established a thermal dynamics model to predict the future trend of HVAC systems. Twin delayed deep deterministic policy gradient algorithm and model predictive control (TD3-MPC) were proposed to pre-adjust building temperatures at off-peak times.…”
Section: Predictive Controlmentioning
confidence: 99%
“…In such cases, AC methods were utilized to tune controller parameters, [57,58] or identify the process dynamics [59] and help generate control actions. [60][61][62][63] Overall, AC methods have been found very suitable for set-point tracking and disturbance rejection in real-time implementations. Table 1 summarizes important studies, which utilized AC methods for process control.…”
Section: Process Control Applicationsmentioning
confidence: 99%
“…As to the use of application software, approximately 60% of the studies, utilized MATLAB software for controller development, integration, or real-time implementation. [60,63,69] Other researchers mostly used Tensorflow [64] and Python. [83] Some studies have integrated different functionalities of more than one software applications, for example, (i) MATLAB's process simulation module with Python's ANN generator APIs-PyTorch and Keras [70] and (ii) hybrid training module of Tensorflow and Python's Lambda deep learning workstation.…”
Section: Process Control Applicationsmentioning
confidence: 99%
“…However, to ensure an accurate simulation, some authors run the simulator with a shorter timestep, e.g., 5 min, in which case the simulator takes several steps adding up to 15 min before the next interaction with the RL agent is performed. Some works use longer control timesteps such as 20 min [78] or 60 min [79]. Depending on how the environment has been implemented, and how long it takes for the system to stabilize after a control action, a shorter control time step could result in unstable feedback to the RL agent [80].…”
Section: Overview Of the Analyzed Articlesmentioning
confidence: 99%