2018
DOI: 10.1016/j.enbuild.2017.08.052
|View full text |Cite
|
Sign up to set email alerts
|

Model-free control of thermostatically controlled loads connected to a district heating network

Abstract: Optimal control of thermostatically controlled loads connected to a district heating network is considered a sequential decisionmaking problem under uncertainty. The practicality of a direct model-based approach is compromised by two challenges, namely scalability due to the large dimensionality of the problem and the system identification required to identify an accurate model. To help in mitigating these problems, this paper leverages on recent developments in reinforcement learning in combination with a mar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 59 publications
(21 citation statements)
references
References 49 publications
0
21
0
Order By: Relevance
“…The disadvantage of using such a learning algorithm is that the range space must be discretized and can lead to high computational effort or low accuracy. A summary of the literature survey of TCL models are described in Table 1-1. Literature Description TCL population Control [4] CT, Linear Homogeneous LQR [5] CT, bi-linear Homogeneous non-linear [18] DT, Linear Heterogeneous MPC [19] DT, 2D,Linear Homogeneous with uncertainties MPC [20] CT, probabilistic model Heterogeneous - [22] DT, non-linear Heterogeneous Q-learning As it can be seen from this survey, the controllers used in the past literature for energy management of TCL are MPC or LQR for a linear model and state/output feedback controllers for the non-linear model. Figure 1-4 represents the control achieved by using an MPC for temperature dependent loads.…”
Section: -1 Related Work -Modeling Of Tclmentioning
confidence: 99%
See 1 more Smart Citation
“…The disadvantage of using such a learning algorithm is that the range space must be discretized and can lead to high computational effort or low accuracy. A summary of the literature survey of TCL models are described in Table 1-1. Literature Description TCL population Control [4] CT, Linear Homogeneous LQR [5] CT, bi-linear Homogeneous non-linear [18] DT, Linear Heterogeneous MPC [19] DT, 2D,Linear Homogeneous with uncertainties MPC [20] CT, probabilistic model Heterogeneous - [22] DT, non-linear Heterogeneous Q-learning As it can be seen from this survey, the controllers used in the past literature for energy management of TCL are MPC or LQR for a linear model and state/output feedback controllers for the non-linear model. Figure 1-4 represents the control achieved by using an MPC for temperature dependent loads.…”
Section: -1 Related Work -Modeling Of Tclmentioning
confidence: 99%
“…The solutionĉ andŵ can be found by minimizing e i.k in a least squares sense. The equation (4)(5)(6)(7)(8)(9)(10)(11)(12)(13)(14)(15)(16)(17)(18)(19)(20)(21)(22) does not depend on the system dynamics but only on the state and input measurements. This brings us to the online adaptive Algorithm-5.…”
Section: -4-2 Non-linear Off-policy Optimal Adaptive Algorithmmentioning
confidence: 99%
“…Liu et al [18] report significant gains by controlling apartments' SH demand via thermostats in a large-scale pilot test. Claessens et al [19] develop a model-free methodology for DHN control which also reflects the recently fast-growing popularity of employing reinforcement learning techniques in energy management research.…”
Section: State-of-the-art Dhn Operation and Optimizationmentioning
confidence: 99%
“…Claessens et al control TCLs in a district heating network and obtain a performance within 65% of a theoreticial lower bound on the cost. However, they propose a handcrafted feature extraction method to evaluate what performance can be obtained from this limited state description [13]. More sophisticated feature extraction techniques are proposed by Claessens et al in [14], where a binning of the state space of a cluster of TCLs allows for a convolutional neural network to be used.…”
Section: Dr Control Is a Sequential Decision-making Problem Which Resmentioning
confidence: 99%