Continuous Relaxation of Symbolic Planner for One-Shot Imitation Learning

Huang, De-An; Xu, Danfei; Zhu, Yuke; Garg, Animesh; Savarese, Silvio; Li, Feifei; Niebles, Juan Carlos

doi:10.1109/iros40897.2019.8967761

Cited by 24 publications

(18 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For visual inputs, [11] and [21] experimented with 3 separate settings: simulated planer reaching (with different target object colors), simulated planer pushing (with varying target object locations), and real-robot, object-in-hand placing (onto different target containers); [45] set up a two-stage pick-then-place task with varying target objects and target containers; [7] uses a simulated Pick & Place task with 4 objects to pick and 4 target bins to place (hence 16 variations in total). The AI2-THOR [22] environment used in [19] requires collecting varying objects and dropping off at their designated receptacles, where actions are purely semantic concepts such as "dropoff" or "search". In contrast, in this work we consider a harder, multi-task setup, where agent needs to perform well across more diverse and distinct tasks, and generalize not only to new instances of all the seen variations, but also to completely novel tasks.…”

Section: F Further Discussion On Related Workmentioning

confidence: 99%

“…Later work extended OSIL to observe visual inputs: [11] applies the Model-Agnostic Meta-Learning algorithm (MAML) [10] to adapt policy model parameters for new tasks; TecNets [21] applies a hinge rank loss to learn explicit task embeddings; DAML [45] adds a domain-adaptation objective to MAML to use human demonstration videos; [7] improves policy network with Transformer architecture [41]. Another line of work learns modular task structures that can be reused at test time [43] [18] [19], but outputs of these symbolic policies are highly abstracted into semantic action concepts (e.g. "pick", "release") that assume extensive domain knowledge and human-designed priors.…”

Section: Related Work Imitation Learningmentioning

confidence: 99%

See 1 more Smart Citation

Towards More Generalizable One-shot Visual Imitation Learning

Zhao¹,

Liu²,

Lee³

et al. 2021

Preprint

View full text Add to dashboard Cite

A general-purpose robot should be able to master a wide range of tasks and quickly learn a novel one by leveraging past experiences. One-shot imitation learning (OSIL) approaches this goal by training an agent with (pairs of) expert demonstrations, such that at test time, it can directly execute a new task from just one demonstration. However, so far this framework has been limited to training on many variations of one task, and testing on other unseen but similar variations of the same task. In this work, we push for a higher level of generalization ability by investigating a more ambitious multi-task setup. We introduce a diverse suite of vision-based robot manipulation tasks, consisting of 7 tasks, a total of 61 variations, and a continuum of instances within each variation. For consistency and comparison purposes, we first train and evaluate single-task agents (as done in prior few-shot imitation work). We then study the multi-task setting, where multi-task training is followed by (i) one-shot imitation on variations within the training tasks, (ii) one-shot imitation on new tasks, and (iii) fine-tuning on new tasks. Prior state-of-theart, while performing well within some single tasks, struggles in these harder multi-task settings. To address these limitations, we propose MOSAIC (Multi-task One-Shot Imitation with self-Attention and Contrastive learning), which integrates a self-attention model architecture and a temporal contrastive module to enable better task disambiguation and more robust representation learning. Our experiments show that MOSAIC outperforms prior state of the art in learning efficiency, final performance, and learns a multi-task policy with promising generalization ability via fine-tuning on novel tasks.

show abstract

Section: F Further Discussion On Related Workmentioning

confidence: 99%

Section: Related Work Imitation Learningmentioning

confidence: 99%

Towards More Generalizable One-shot Visual Imitation Learning

Zhao¹,

Liu²,

Lee³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…The effort required to define new actions makes scalability an issue. Future work includes using learning-based approaches to alleviate the engineering bottlenecks, such as learning the preconditions and postconditions of the symbolic actions, as is considered by Huang et al [32], or learning the constraint functions from demonstration.…”

Section: Discussionmentioning

confidence: 99%

Object-Centric Task and Motion Planning in Dynamic Environments

Migimatsu

Bohg

2020

IEEE Robot. Autom. Lett.

View full text Add to dashboard Cite

We address the problem of applying Task and Motion Planning (TAMP) in real world environments. TAMP combines symbolic and geometric reasoning to produce sequential manipulation plans, typically specified as joint-space trajectories, which are valid only as long as the environment is static and perception and control are highly accurate. In case of any changes in the environment, slow re-planning is required. We propose a TAMP algorithm that optimizes over Cartesian frames defined relative to target objects. The resulting plan then remains valid even if the objects are moving and can be executed by reactive controllers that adapt to these changes in real time. We apply our TAMP framework to a torque-controlled robot in a pick and place setting and demonstrate its ability to adapt to changing environments, inaccurate perception, and imprecise control, both in simulation and the real world.

show abstract

“…Given these challenges, its natural to examine the use of learning to improve task and motion planning with real sensing [4,5,6,7,8,9]. However, previous methods fail to solve the full problem of unknown object rearrangement with physical robots.…”

Section: Introductionmentioning

confidence: 99%

“…However, previous methods fail to solve the full problem of unknown object rearrangement with physical robots. Some only operate on known objects [6,9], others ignore or significantly restrict the space of robot control [4,5] or relations [7,8], while still others make assume an explicit goal configuration is given [5]. An alternative approach to solve complex manipulation tasks relies on learning model-free neural net policies instead of explicit models of conditions and effects [10,11].…”

Section: Introductionmentioning

confidence: 99%

Predicting Stable Configurations for Semantic Placement of Novel Objects

Paxton,

Xie,

Hermans

et al. 2021

Preprint

View full text Add to dashboard Cite

Human environments contain numerous objects configured in a variety of arrangements. Our goal is to enable robots to repose previously unseen objects according to learned semantic relationships in novel environments. We break this problem down into two parts: (1) finding physically valid locations for the objects and (2) determining if those poses satisfy learned, high-level semantic relationships. We build our models and training from the ground up to be tightly integrated with our proposed planning algorithm for semantic placement of unknown objects. We train our models purely in simulation, with no fine-tuning needed for use in the real world. Our approach enables motion planning for semantic rearrangement of unknown objects in scenes with varying geometry from only RGB-D sensing. Our experiments through a set of simulated ablations demonstrate that using a relational classifier alone is not sufficient for reliable planning. We further demonstrate the ability of our planner to generate and execute diverse manipulation plans through a set of real-world experiments with a variety of objects.

show abstract

Continuous Relaxation of Symbolic Planner for One-Shot Imitation Learning

Cited by 24 publications

References 20 publications

Towards More Generalizable One-shot Visual Imitation Learning

Towards More Generalizable One-shot Visual Imitation Learning

Object-Centric Task and Motion Planning in Dynamic Environments

Predicting Stable Configurations for Semantic Placement of Novel Objects

Contact Info

Product

Resources

About