Building HVAC Scheduling Using Reinforcement Learning via Neural Network Based Model Approximation

Zhang, Chi; Kuppannagari, Sanmukh R.; Kannan, Rajgopal; Prasanna, Viktor K.

doi:10.1145/3360322.3360861

Cited by 84 publications

(67 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this list, two papers only focus on outdoor air quality [64,97], eight papers do not include any AI-specific prediction algorithms [22,31,47,50,84,91,116,122] or were based on some mathematical approaches. Three papers [12,96,137] only focused on thermal comfort (temperature and/or humidity data) or other smart building aspects instead of air quality. Moreover, two studies [89,120] were rejected because they were limited to a monitoring system design and no prediction system was implemented.…”

Section: Study Selectionmentioning

confidence: 99%

Indoor air quality prediction systems for smart environments: A systematic review

Saini

Dutta

Marques

2020

AIS

View full text Add to dashboard Cite

Air quality is a critical matter of concern in terms of the impact on public health and well-being. Although the consequences of poor air quality are more severe in developing countries, they also have a critical impact in developed countries. Healthcare costs due to air pollution reach $150 billion in the USA, whereas particulate matter causes 412,000 premature deaths in Europe, every year. According to the Environmental Protection Agency (EPA), indoor air pollutant levels can be up to 100 times higher in comparison to outdoor air quality. Indoor air quality (IAQ) is in the top five environmental risks to global health and well-being. The research community explored the scope of artificial intelligence (AI) in the past years to deal with this problem. The IAQ prediction systems contribute to smart environments where advanced sensing technologies can create healthy living conditions for building occupants. This paper reviews the applications and potential of AI for the prediction of IAQ to enhance building environment and public health. The results show that most of the studies analyzed incorporate neural networks-based models and the preferred evaluation metrics are RMSE, R 2 score and error rate. Furthermore, 66.6% of the studies include CO2 sensors for IAQ assessment. Temperature and humidity parameters are also included in 90.47% and 85.71% of the proposed methods, respectively. This study also presents some limitations of the current research activities associated with the evaluation of the impact of different pollutants based on different geographical conditions and living environments. Moreover, the use of reliable and calibrated sensor networks for real-time data collection is also a significant challenge.

show abstract

Section: Study Selectionmentioning

confidence: 99%

Indoor air quality prediction systems for smart environments: A systematic review

Saini

Dutta

Marques

2020

AIS

View full text Add to dashboard Cite

show abstract

“…A holistic DRL method for the energy management of commercial buildings was presented in [30] where Heating, Ventilation, and Air conditioning (HVAC) system, lighting, blind, and window systems are controlled to achieve energy savings within the buildings' occupants comfort in terms of thermal, air quality, and illumination conditions. To resolve the limit of model-free DRL methods such as low sample efficiency, a model-based RL method was developed for building HVAC control that trains the system dynamics using neural networks [31]. Based on the trained system dynamics, the operation of the HVAC system was managed by model predictive control to minimize both the energy cost and the indoor temperature constraints violation.…”

Section: Introductionmentioning

confidence: 99%

Energy Management of Smart Home with Home Appliances, Energy Storage System and Electric Vehicle: A Hierarchical Deep Reinforcement Learning Approach

Lee

Choi

2020

Sensors

View full text Add to dashboard Cite

This paper presents a hierarchical deep reinforcement learning (DRL) method for the scheduling of energy consumptions of smart home appliances and distributed energy resources (DERs) including an energy storage system (ESS) and an electric vehicle (EV). Compared to Q-learning algorithms based on a discrete action space, the novelty of the proposed approach is that the energy consumptions of home appliances and DERs are scheduled in a continuous action space using an actor–critic-based DRL method. To this end, a two-level DRL framework is proposed where home appliances are scheduled at the first level according to the consumer’s preferred appliance scheduling and comfort level, while the charging and discharging schedules of ESS and EV are calculated at the second level using the optimal solution from the first level along with the consumer environmental characteristics. A simulation study is performed in a single home with an air conditioner, a washing machine, a rooftop solar photovoltaic system, an ESS, and an EV under a time-of-use pricing. Numerical examples under different weather conditions, weekday/weekend, and driving patterns of the EV confirm the effectiveness of the proposed approach in terms of total cost of electricity, state of energy of the ESS and EV, and consumer preference.

show abstract

“…Although a simulated building model can be used to accelerate the training process, it needs a highdelity model, which is hard to calibrate [6,7]. Recently, Model-Based Reinforcement Learning (MBRL) has been tested for HVAC control to achieve high data eciency [10]. The HVAC system BuildSys '20, November 18-20, 2020, Virtual Event, Japan Xianzhong Ding, Wan Du, and Alberto E. Cerpa dynamics is rst learned using a neural network based on historical HVAC data.…”

Section: Introductionmentioning

confidence: 99%

“…The HVAC system BuildSys '20, November 18-20, 2020, Virtual Event, Japan Xianzhong Ding, Wan Du, and Alberto E. Cerpa dynamics is rst learned using a neural network based on historical HVAC data. Based on the learned building dynamics model, an MPC controller tries to nd the optimal control action by using a Random Shooting (RS) method [10]. For controlling a single-zone HVAC system, an MBRL-based approach saves approximately 10⇥ training time of the MFRL approach, while achieving comparable performance [10].…”

Section: Introductionmentioning

confidence: 99%

“…Based on the learned building dynamics model, an MPC controller tries to nd the optimal control action by using a Random Shooting (RS) method [10]. For controlling a single-zone HVAC system, an MBRL-based approach saves approximately 10⇥ training time of the MFRL approach, while achieving comparable performance [10]. However, most of the commercial buildings are multi-zone buildings [11].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Mb2c

Ding

Cerpa

2020

Proceedings of the 7th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation

View full text Add to dashboard Cite

Reinforcement learning has been widely studied for controlling Heating, Ventilation, and Air conditioning (HVAC) systems. Most of the existing works are focused on Model-Free Reinforcement Learning (MFRL), which learns an agent by extensively trial-and-error interaction with a real building. However, one of the fundamental problems with MFRL is the very large amount of training data required to converge to acceptable performance. Although simulation models have been used to generate sucient training data to accelerate the training process, MFRL needs a high-delity building model for simulation, which is also hard to calibrate. As a result, Model-Based Reinforcement Learning (MBRL) has been used for HVAC control. While MBRL schemes can achieve excellent sample eciency (i.e. less training data), they often lag behind model-free approaches in terms of asymptotic control performance (i.e. high energy savings while meeting occupants' thermal comfort). In this paper, we conduct a set of experiments to analyze the limitations of current MBRL-based HVAC control methods, in terms of model uncertainty and controller eectiveness. Using the lessons learned, we develop MB 2 C, a novel MBRL-based HVAC control system that can achieve high control performance with excellent sample eciency. MB 2 C learns the building dynamics by employing an ensemble of environment-conditioned neural networks. It then applies a new control method, Model Predictive Path Integral (MPPI), for HVAC control. It produces candidate action sequences by using an importance sampling weighted algorithm that scales better to high state and action dimensions of multi-zone buildings. We evaluate MB 2 C using EnergyPlus simulations in a ve-zone oce building. The results show that MB 2 C can achieve 8.23% more energy savings compared to the state-of-the-art MBRL solution while maintaining similar thermal comfort. MB 2 C can reduce the training data set by an order of magnitude (10.52⇥) while achieving comparable performance to MFRL approaches. CCS CONCEPTS • Computing methodologies → Control methods.

show abstract

Building HVAC Scheduling Using Reinforcement Learning via Neural Network Based Model Approximation

Cited by 84 publications

References 25 publications

Indoor air quality prediction systems for smart environments: A systematic review

Indoor air quality prediction systems for smart environments: A systematic review

Energy Management of Smart Home with Home Appliances, Energy Storage System and Electric Vehicle: A Hierarchical Deep Reinforcement Learning Approach

Mb2c

Contact Info

Product

Resources

About