SPOTTER: Extending Symbolic Planning Operators through Targeted Reinforcement Learning

Sarathy, Vasanth; Kasenberg, Daniel; Goel, Shivam; Sinapov, Jivko; Scheutz, Matthias

doi:10.48550/arxiv.2012.13037

Cited by 3 publications

(5 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, previous work has considered learning propositional [Zhang et al, 2018, Dittadi et al, 2020, Tsividis, 2019 or lifted [Arora et al, 2018, Asai and Fukunaga, 2018, Asai, 2019, Asai and Muise, 2020, Ames et al, 2018, Ahmetoglu et al, 2020 symbolic transition models, and using them with AI planners [Hoffmann, 2001, Helmert, 2006. Other related work has used symbolic planners as managers in hierarchical RL, where low-level option policies are learned [Lyu et al, 2019, Sarathy et al, 2020, Gordon et al, 2019, Illanes et al, 2020, Yang et al, 2018, Kokel et al, 2021. In contrast to all these, we are focused on robotic settings where the planner must handle geometric considerations in addition to the symbolic ones.…”

Section: Related Workmentioning

confidence: 99%

Learning Neuro-Symbolic Relational Transition Models for Bilevel Planning

Chitnis¹,

Silver²,

Tenenbaum³

et al. 2021

Preprint

View full text Add to dashboard Cite

Despite recent, independent progress in model-based reinforcement learning and integrated symbolic-geometric robotic planning, synthesizing these techniques remains challenging because of their disparate assumptions and strengths. In this work, we take a step toward bridging this gap with Neuro-Symbolic Relational Transition Models (NSRTs), a novel class of transition models that are data-efficient to learn, compatible with powerful robotic planning methods, and generalizable over objects. NSRTs have both symbolic and neural components, enabling a bilevel planning scheme where symbolic AI planning in an outer loop guides continuous planning with neural models in an inner loop. Experiments in four robotic planning domains show that NSRTs can be learned after only tens or hundreds of training episodes, and then used for fast planning in new tasks that require up to 60 actions to reach the goal and involve many more objects than were seen during training. Video: https://tinyurl.com/chitnis-nsrts * Equal contribution Preprint. Under review.

show abstract

Section: Related Workmentioning

confidence: 99%

Learning Neuro-Symbolic Relational Transition Models for Bilevel Planning

Chitnis¹,

Silver²,

Tenenbaum³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…The process of collecting experiences for learning is timeconsuming and the sample efficiency is low. To alleviate those issues, researchers have investigated the combination of HRL and symbolic planning to improve transferability, interpretability, and data efficiency (Ryan 2002;Leonetti, Iocchi, and Stone 2016;Yang et al 2018;Lyu et al 2019;Illanes et al 2020;Sarathy et al 2020;Lee et al 2021). In those works, the original MDP is divided into two levels.…”

Section: Introductionmentioning

confidence: 99%

“…A more realistic idea is to automatically learn action models from training data (Zhuo and Kambhampati 2017;Yang, Wu, and Jiang 2007;Ng and Petrick 2019;Martínez et al 2016;James, Rosman, and Konidaris 2020) and exploit the learnt action models to generate plans for guiding the exploration of options. Although there is indeed an approach (Sarathy et al 2020) proposed to learn action models automatically, they still need to manually define major parts of action models in advance. Besides, the planning goal in this approach is kept unchanged while in our framework it is dynamically adapted to maximize the external reward.…”

Section: Introductionmentioning

confidence: 99%

Creativity of AI: Automatic Symbolic Option Discovery for Facilitating Deep Reinforcement Learning

Ma²,

Jin

et al. 2022

AAAI

View full text Add to dashboard Cite

Despite of achieving great success in real life, Deep Reinforcement Learning (DRL) is still suffering from three critical issues, which are data efficiency, lack of the interpretability and transferability. Recent research shows that embedding symbolic knowledge into DRL is promising in addressing those challenges. Inspired by this, we introduce a novel deep reinforcement learning framework with symbolic options. This framework features a loop training procedure, which enables guiding the improvement of policy by planning with action models and symbolic options learned from interactive trajectories automatically. The learned symbolic options help doing the dense requirement of expert domain knowledge and provide inherent interpretabiliy of policies. Moreover, the transferability and data efficiency can be further improved by planning with the action models. To validate the effectiveness of this framework, we conduct experiments on two domains, Montezuma's Revenge and Office World respectively, and the results demonstrate the comparable performance, improved data efficiency, interpretability and transferability.

show abstract

“…To alleviate those issues, researchers have investigated the combination of h-DRL and symbolic planning to improve transferability, interpretability, and data efficiency (Parr and Russell 1997;Ryan 2002;Hogg, Kuter, and Muñoz-Avila 2010;Leonetti, Iocchi, and Stone 2016;Yang et al 2018;Lyu et al 2019;Illanes et al 2020;Sarathy et al 2020). In this structure, the original MDP is divided into two levels.…”

Section: Introductionmentioning

confidence: 99%

“…A more realistic idea is to automatically learn action models from training data (Zhuo and Kambhampati 2017;Yang, Wu, and Jiang 2007;Ng and Petrick 2019;Martínez et al 2016;James, Rosman, and Konidaris 2020) and exploit the learnt action models to generate plans for guiding the exploration of options. Although there is indeed an approach (Sarathy et al 2020) proposed to learn action models automatically, they still need to manually define a major part of the models in advance. Besides, the planning goal in this approach is kept unchanged while it is dynamically adapted to maximize the external reward in our framework.…”

Section: Introductionmentioning

confidence: 99%

Creativity of AI: Automatic Symbolic Option Discovery for Facilitating Deep Reinforcement Learning

Mu¹,

Ma²,

Jin³

et al. 2021

Preprint

View full text Add to dashboard Cite

Despite of achieving great success in real life, Deep Reinforcement Learning (DRL) is still suffering from three critical issues, which are data efficiency, lack of the interpretability and transferability. Recent research shows that embedding symbolic knowledge into DRL is promising in addressing those challenges. Inspired by this, we introduce a novel deep reinforcement learning framework with symbolic options. This framework features a loop training procedure, which enables guiding the improvement of policy by planning with action models and symbolic options learned from interactive trajectories automatically. The learned symbolic options alleviate the dense requirement of expert domain knowledge and provide inherent interpretability of policies. Moreover, the transferability and data efficiency can be further improved by planning with the action models. To validate the effectiveness of this framework, we conduct experiments on two domains, Montezuma's Revenge and Office World, respectively. The results demonstrate the comparable performance, improved data efficiency, interpretability and transferability.

show abstract

SPOTTER: Extending Symbolic Planning Operators through Targeted Reinforcement Learning

Cited by 3 publications

References 20 publications

Learning Neuro-Symbolic Relational Transition Models for Bilevel Planning

Learning Neuro-Symbolic Relational Transition Models for Bilevel Planning

Creativity of AI: Automatic Symbolic Option Discovery for Facilitating Deep Reinforcement Learning

Creativity of AI: Automatic Symbolic Option Discovery for Facilitating Deep Reinforcement Learning

Contact Info

Product

Resources

About