“…The application of reinforcement learning (RL) models has been on a rise following the onset of deep reinforcement learning [1]. The ability of RL models to solve complex problems, represented mostly by the association of high-dimensional states and a large number of discrete or continuous actions, has recently led to the development of expert systems for guiding autonomous cars [2], [3], predicting the stock exchange impact [4], [5], and coordinating a swarm of robots to protect the environment [6], [7], to name a few examples.…”