2022
DOI: 10.48550/arxiv.2205.11384
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning Long-Horizon Robot Exploration Strategies for Multi-Object Search in Continuous Action Spaces

Abstract: Recent advances in vision-based navigation and exploration have shown impressive capabilities in photorealistic indoor environments. However, these methods still struggle with long-horizon tasks and require large amounts of data to generalize to unseen environments. In this work, we present a novel reinforcement learning approach for multi-object search that combines short-term and long-term reasoning in a single model while avoiding the complexities arising from hierarchical structures. In contrast to existin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 19 publications
0
3
0
Order By: Relevance
“…Wixson and Ballard [28] remarks that selecting views for object search in a local region is a harder problem than the selection of which region to search in. Most works that demonstrate robotic search within a search region reduce the problem to 2D [29,6,30,7,31,32,33,34]. For a literature survey and taxonomy of object search, refer to [12].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Wixson and Ballard [28] remarks that selecting views for object search in a local region is a harder problem than the selection of which region to search in. Most works that demonstrate robotic search within a search region reduce the problem to 2D [29,6,30,7,31,32,33,34]. For a literature survey and taxonomy of object search, refer to [12].…”
Section: Related Workmentioning
confidence: 99%
“…Deep learning methods that typically map raw observations to actions [35,36,37,38,34] can enable 3D object search, yet it is hard to train such a model on a robot and ensure generalization to a new real-world environment; ongoing work (e.g., [34]) is addressing this challenge. In contrast, GenMOS only requires basic perception capabilities such as object detection and localization to enable object search.…”
Section: Related Workmentioning
confidence: 99%
“…However, they are computationally expensive, often do not optimize past a limited horizon, and can struggle when objectives oppose each other. Learning-based methods efficiently learn directly from high-dimensional observations and are well suited to handle unexplored environments [14]- [16]. Nevertheless, they either restrict the action space [14], [17] or rely on expert demonstrations [15] to cope with the highdimensional action space and long-horizon nature of mobile manipulation.…”
Section: Introductionmentioning
confidence: 99%