Interleaving Monte Carlo Tree Search and Self-Supervised Learning for Object Retrieval in Clutter

Huang, Baichuan; Boularias, Abdeslam; Yu, Jingjin

doi:10.48550/arxiv.2202.01426

Cited by 1 publication

(1 citation statement)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Outside of symbolic planning, Huang et al [26] learned to retrieving a target object from clutter performing a Monte Carlo search over highlevel non-prehensile actions. Its search time was subsequently improved by learning to predict the discounted reward of each branch in MCTS without the need to roll out [27]. Bai et al [28] also tackle non-prehensile manipulation by proposing an MCTS algorithm guided by a policy network which is trained by imitation and reinforcement.…”

Section: Related Workmentioning

confidence: 99%

Coarse-to-fine Q-attention with Tree Expansion

James¹,

Abbeel²

2022

Preprint

View full text Add to dashboard Cite

Coarse-to-fine Q-attention enables sample-efficient robot manipulation by discretizing the translation space in a coarse-to-fine manner, where the resolution gradually increases at each layer in the hierarchy. Although effective, Q-attention suffers from "coarse ambiguity" -when voxelization is significantly coarse, it is not feasible to distinguish similar-looking objects without first inspecting at a finer resolution. To combat this, we propose to envision Q-attention as a tree that can be expanded and used to accumulate value estimates across the top-k voxels at each Q-attention depth. When our extension, Q-attention with Tree Expansion (QTE), replaces standard Qattention in the Attention-driven Robot Manipulation (ARM) system, we are able to accomplish a larger set of tasks; especially on those that suffer from "coarse ambiguity". In addition to evaluating our approach across 12 RLBench tasks, we also show that the improved performance is visible in a real-world task involving small objects. Videos and code found at: https: //sites.google.com/view/q-attention-qte.

show abstract

Section: Related Workmentioning

confidence: 99%