“…Keepaway is a subtask of RoboCup that was put forth as a testbed for machine learning in 2001 (Stone & Sutton, 2001). It has since been used for research on temporal difference reinforcement learning with function approximation (Stone, Sutton, & Kuhlmann, 2005), evolutionary learning (Pietro et al, 2002), relational reinforcement learning (Walker et al, 2004), behaviour transfer (Cheng et al, 2018;Didi & Nitschke, 2016a, 2016bNitschke & Didi, 2017;Schwab et al, 2018;, batch reinforcement learning (Riedmiller et al, 2009) and hierarchical reinforcement learning (Bai & Russell, 2017).…”