Abstract:Optimal control of thermostatically controlled loads connected to a district heating network is considered a sequential decisionmaking problem under uncertainty. The practicality of a direct model-based approach is compromised by two challenges, namely scalability due to the large dimensionality of the problem and the system identification required to identify an accurate model. To help in mitigating these problems, this paper leverages on recent developments in reinforcement learning in combination with a mar… Show more
“…The disadvantage of using such a learning algorithm is that the range space must be discretized and can lead to high computational effort or low accuracy. A summary of the literature survey of TCL models are described in Table 1-1. Literature Description TCL population Control [4] CT, Linear Homogeneous LQR [5] CT, bi-linear Homogeneous non-linear [18] DT, Linear Heterogeneous MPC [19] DT, 2D,Linear Homogeneous with uncertainties MPC [20] CT, probabilistic model Heterogeneous - [22] DT, non-linear Heterogeneous Q-learning As it can be seen from this survey, the controllers used in the past literature for energy management of TCL are MPC or LQR for a linear model and state/output feedback controllers for the non-linear model. Figure 1-4 represents the control achieved by using an MPC for temperature dependent loads.…”
Section: -1 Related Work -Modeling Of Tclmentioning
confidence: 99%
“…The solutionĉ andŵ can be found by minimizing e i.k in a least squares sense. The equation (4)(5)(6)(7)(8)(9)(10)(11)(12)(13)(14)(15)(16)(17)(18)(19)(20)(21)(22) does not depend on the system dynamics but only on the state and input measurements. This brings us to the online adaptive Algorithm-5.…”
“…The disadvantage of using such a learning algorithm is that the range space must be discretized and can lead to high computational effort or low accuracy. A summary of the literature survey of TCL models are described in Table 1-1. Literature Description TCL population Control [4] CT, Linear Homogeneous LQR [5] CT, bi-linear Homogeneous non-linear [18] DT, Linear Heterogeneous MPC [19] DT, 2D,Linear Homogeneous with uncertainties MPC [20] CT, probabilistic model Heterogeneous - [22] DT, non-linear Heterogeneous Q-learning As it can be seen from this survey, the controllers used in the past literature for energy management of TCL are MPC or LQR for a linear model and state/output feedback controllers for the non-linear model. Figure 1-4 represents the control achieved by using an MPC for temperature dependent loads.…”
Section: -1 Related Work -Modeling Of Tclmentioning
confidence: 99%
“…The solutionĉ andŵ can be found by minimizing e i.k in a least squares sense. The equation (4)(5)(6)(7)(8)(9)(10)(11)(12)(13)(14)(15)(16)(17)(18)(19)(20)(21)(22) does not depend on the system dynamics but only on the state and input measurements. This brings us to the online adaptive Algorithm-5.…”
“…Liu et al [18] report significant gains by controlling apartments' SH demand via thermostats in a large-scale pilot test. Claessens et al [19] develop a model-free methodology for DHN control which also reflects the recently fast-growing popularity of employing reinforcement learning techniques in energy management research.…”
Section: State-of-the-art Dhn Operation and Optimizationmentioning
Users may download and print one copy of any publication from the public portal for the purpose of private study or research. You may not further distribute the material or use it for any profit-making activity or commercial gain You may freely distribute the URL identifying the publication in the public portal If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim.
“…Claessens et al control TCLs in a district heating network and obtain a performance within 65% of a theoreticial lower bound on the cost. However, they propose a handcrafted feature extraction method to evaluate what performance can be obtained from this limited state description [13]. More sophisticated feature extraction techniques are proposed by Claessens et al in [14], where a binning of the state space of a cluster of TCLs allows for a convolutional neural network to be used.…”
Section: Dr Control Is a Sequential Decision-making Problem Which Resmentioning
The increasing share of renewable energy sources in the electricity grid results in a higher degree of uncertainty regarding electrical energy production. In response to this, flexibility of the demand has been proposed as part of the solution. An important source of flexibility available at the residential consumer side are thermostatically controlled loads (TCLs). In this paper the activation of this source of flexibility is achieved by applying batch reinforcement learning (BRL) to an electric water heater (EWH) in a Time of Use (ToU) setting. The cost performance of six BRL agents with six different state spaces is compared quantitatively. In every case, the BRL agent can successfully shift energy consumption within 20-25 days. The performance of an agent with access to multiple temperature sensors along the height of the EWH is comparable to the performance of an agent with access to only the highest temperature sensor. This indicates manufacturing costs related to sensors can be reduced while maintaining the same performance. Additionally, results show that the inclusion of a theoretical state of charge value in the state space increases performance by more than 8% compared to the performance of the other BRL agents. It is therefore argued that an estimation of the state of charge should be included in future work as it would increase cost performance.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.