“…The numerous successful applications of reinforcement learning include (in no particular order) learning in games (e.g., Backgammon (Tesauro, 1994) and Go (Silver et al, 2007)), applications in networking (e.g., packet routing (Boyan and Littman, 1994), channel allocation (Singh and Bertsekas, 1997)), applications to operations research problems (e.g., targeted marketing (Abe et al, 2004), maintenance problems (Gosavi, 2004), job-shop scheduling (Zhang and Dietterich, 1995), elevator control (Crites and Barto, 1996), pricing (Rusmevichientong et al, 2006), vehicle routing (Proper and Tadepalli, 2006), inventory control (Chang et al, 2007), fleet management (Simão et al, 2009)), learning in robotics (e.g., controlling quadrupedales (Kohl and Stone, 2004), humanoid robots (Peters et al, 2003), or helicopters (Abbeel et al, 2007)), and applications to finance (e.g., option pricing (Tsitsiklis andVan Roy, 1999b, 2001;Yu and Bertsekas, 2007;Li et al, 2009) …”