“…Other recent studies raise the concern about the dynamic variation of metrics used for allocation policies, such as execution time or CPU utilization, which can vary significantly over time or on different nodes of the continuum, and, for this reason, they employ ML-based solutions to learn workload patterns [11]. In the computing continuum environment proposed in [12], each edge cluster contains a scheduler node in charge of receiving requests from clients and of taking decisions whether to execute the task locally, on the cloud or to reject it; the decision is based on a Reinforcement Learning (RL) engine which receives a positive reward for every task completed within a deadline. All the previous studies assume a global IoT workload balance.…”