2022
DOI: 10.48550/arxiv.2202.01426
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Interleaving Monte Carlo Tree Search and Self-Supervised Learning for Object Retrieval in Clutter

Abstract: In this study, working with the task of object retrieval in clutter, we have developed a robot learning framework in which Monte Carlo Tree Search (MCTS) is first applied to enable a Deep Neural Network (DNN) to learn the intricate interactions between a robot arm and a complex scene containing many objects, allowing the DNN to partially clone the behavior of MCTS. In turn, the trained DNN is integrated into MCTS to help guide its search effort. We call this approach Monte Carlo tree search and learning for Ob… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 31 publications
0
1
0
Order By: Relevance
“…Outside of symbolic planning, Huang et al [26] learned to retrieving a target object from clutter performing a Monte Carlo search over highlevel non-prehensile actions. Its search time was subsequently improved by learning to predict the discounted reward of each branch in MCTS without the need to roll out [27]. Bai et al [28] also tackle non-prehensile manipulation by proposing an MCTS algorithm guided by a policy network which is trained by imitation and reinforcement.…”
Section: Related Workmentioning
confidence: 99%
“…Outside of symbolic planning, Huang et al [26] learned to retrieving a target object from clutter performing a Monte Carlo search over highlevel non-prehensile actions. Its search time was subsequently improved by learning to predict the discounted reward of each branch in MCTS without the need to roll out [27]. Bai et al [28] also tackle non-prehensile manipulation by proposing an MCTS algorithm guided by a policy network which is trained by imitation and reinforcement.…”
Section: Related Workmentioning
confidence: 99%