Catch Me If You Hear Me: Audio-Visual Navigation in Complex Unmapped Environments with Moving Sounds

Younes, Alaa; Honerkamp, Daniel; Welschehold, Tim; Valada, Abhinav

doi:10.48550/arxiv.2111.14843

Cited by 2 publications

(2 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In PointGoal navigation [22,4], the agent at each step receives the displacement vector to the goal that it has to reach. Whereas, in AudioGoal navigation [6,27], the agent at each step receives an audio signal emitted by a target object. Conversely, in ObjectGoal navigation [29,5,20,10], the agent receives an object category that it has to navigate to.…”

Section: Related Workmentioning

confidence: 99%

Learning Long-Horizon Robot Exploration Strategies for Multi-Object Search in Continuous Action Spaces

Fabian¹,

Honerkamp²,

Welschehold³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Recent advances in vision-based navigation and exploration have shown impressive capabilities in photorealistic indoor environments. However, these methods still struggle with long-horizon tasks and require large amounts of data to generalize to unseen environments. In this work, we present a novel reinforcement learning approach for multi-object search that combines short-term and long-term reasoning in a single model while avoiding the complexities arising from hierarchical structures. In contrast to existing multi-object search methods that act in granular discrete action spaces, our approach achieves exceptional performance in continuous action spaces. We perform extensive experiments and show that it generalizes to unseen apartment environments with limited data. Furthermore, we demonstrate zero-shot transfer of the learned policies to an office environment in real world experiments.

show abstract

Section: Related Workmentioning

confidence: 99%

Learning Long-Horizon Robot Exploration Strategies for Multi-Object Search in Continuous Action Spaces

Fabian¹,

Honerkamp²,

Welschehold³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…W HILE recent progress in control and perception has propelled the capabilities of robotic platforms to autonomously operate in unknown and unstructured environments [1]- [4], this has largely focused on pure navigation tasks [5], [6]. In this work, we focus on autonomous mobile manipulation which combines the difficulties of navigating unstructured, human-centered environments with the complexity of jointly controlling the base and arm.…”

Section: Introductionmentioning

confidence: 99%

N$^2$M$^2$: Learning Navigation for Arbitrary Mobile Manipulation Motions in Unseen and Dynamic Environments

Honerkamp¹,

Welschehold²,

Valada³

2022

Preprint

Self Cite

View full text Add to dashboard Cite

Despite its importance in both industrial and service robotics, mobile manipulation remains a significant challenge as it requires a seamless integration of end-effector trajectory generation with navigation skills as well as reasoning over long-horizons. Existing methods struggle to control the large configuration space, and to navigate dynamic and unknown environments. In previous work, we proposed to decompose mobile manipulation tasks into a simplified motion generator for the end-effector in task space and a trained reinforcement learning agent for the mobile base to account for kinematic feasibility of the motion. In this work, we introduce Neural Navigation for Mobile Manipulation (N 2 M 2 ) which extends this decomposition to complex obstacle environments and enables it to tackle a broad range of tasks in real world settings. The resulting approach can perform unseen, long-horizon tasks in unexplored environments while instantly reacting to dynamic obstacles and environmental changes. At the same time, it provides a simple way to define new mobile manipulation tasks. We demonstrate the capabilities of our proposed approach in extensive simulation and real-world experiments on multiple kinematically diverse mobile manipulators. Code and videos are publicly available at http://mobile-rl.cs.uni-freiburg.de.

show abstract

Catch Me If You Hear Me: Audio-Visual Navigation in Complex Unmapped Environments with Moving Sounds

Cited by 2 publications

References 37 publications

Learning Long-Horizon Robot Exploration Strategies for Multi-Object Search in Continuous Action Spaces

Learning Long-Horizon Robot Exploration Strategies for Multi-Object Search in Continuous Action Spaces

N$^2$M$^2$: Learning Navigation for Arbitrary Mobile Manipulation Motions in Unseen and Dynamic Environments

Contact Info

Product

Resources

About