“…While many methods extract a symbolic mapping for RL from visual data, e.g. (Lyu et al, 2019;Yang et al, 2018Yang et al, , 2019Lu et al, 2018;Garnelo et al, 2016;Li et al, 2018;Liang & Boularias, 2018;Goel et al, 2018), they all require that all of the reward-relevant features are explicitly represented in the symbolic space. As shown by the many successes of Deep RL, e.g.…”