Proceedings of the Eleventh ACM International Conference on Future Energy Systems 2020
DOI: 10.1145/3396851.3397694
|View full text |Cite
|
Sign up to set email alerts
|

MARCO - Multi-Agent Reinforcement learning based COntrol of building HVAC systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
19
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 36 publications
(20 citation statements)
references
References 26 publications
1
19
0
Order By: Relevance
“…Comfort is modeled as a setpoint temperature like 23 • C, and the method aims to optimize energy efficiency as long as the indoor temperature is not drifted far from the setpoint. Similar works can also be found in [3]- [6]. However, such empirical assumption on comfort has proven to be inaccurate in many cases, and as a result, the energy saving is often achieved with the significant sacrifice of occupant comfort.…”
mentioning
confidence: 62%
“…Comfort is modeled as a setpoint temperature like 23 • C, and the method aims to optimize energy efficiency as long as the indoor temperature is not drifted far from the setpoint. Similar works can also be found in [3]- [6]. However, such empirical assumption on comfort has proven to be inaccurate in many cases, and as a result, the energy saving is often achieved with the significant sacrifice of occupant comfort.…”
mentioning
confidence: 62%
“…The authors in [45] propose a methods for the optimal scheduling of different household appliances to optimize energy utilization. The authors in [16] propose a MARL algorithm to minimize HVAC energy consumption without sacrificing user comfort by adjusting both the building and chiller set-points. To speed up the training process, they use transfer learning in which the agents are trained on sub-sets of HVAC systems and the learned network weights are used to initialize the multiple agents.…”
Section: B Deep Rl Methods In Bemsmentioning
confidence: 99%
“…In [64], Morinibu et al proposed a A2C-based HVAC control method t uniformity of radiation temperature in the room. In [26], Nagarathinam et al proposed a multi-agent D to minimize HVAC energy consumption without sacrificing user comfort by adjusting both the building an To be specific, each DDQN-based agent coordinate with each other to learn an optimal HVAC control p proposed. For example, Wei et al [15] proposed a DQN-based HVAC control method to save energy cost in office buildings while maintaining the room temperature requirements.…”
Section: Applications Of Drl In a Single Buildingmentioning
confidence: 99%
“…In [64], Morinibu et al proposed a A2C-based HVAC control method to decrease the non-uniformity of radiation temperature in the room. In [26], Nagarathinam et al proposed a multi-agent DRL based algorithm to minimize HVAC energy consumption without sacrificing user comfort by adjusting both the building and chiller set-points. To be specific, each DDQN-based agent coordinate with each other to learn an optimal HVAC control policy.…”
Section: Applications Of Drl In a Single Buildingmentioning
confidence: 99%