ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9415013
|View full text |Cite
|
Sign up to set email alerts
|

A Deep Reinforcement Learning Approach To Audio-Based Navigation In A Multi-Speaker Environment

Abstract: In this work we use deep reinforcement learning to create an autonomous agent that can navigate in a two-dimensional space using only raw auditory sensory information from the environment, a problem that has received very little attention in the reinforcement learning literature. Our experiments show that the agent can successfully identify a particular target speaker among a set of N predefined speakers in a room and move itself towards that speaker, while avoiding collision with other speakers or going outsi… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 8 publications
0
2
0
Order By: Relevance
“…Research studies have also explored the potentials of DRL for solving audio localisation problems. For instance, Giannakopoulos et al (2021) trained an autonomous agent that navigates in a two-dimensional space using audio information from a multi-speaker environment. Based on their results, the authors found that the agent can successfully localise the target speaker among a set of predefined speakers in a room by avoiding confusion and going outside the predefined room boundaries.…”
Section: Other Applicationsmentioning
confidence: 99%
“…Research studies have also explored the potentials of DRL for solving audio localisation problems. For instance, Giannakopoulos et al (2021) trained an autonomous agent that navigates in a two-dimensional space using audio information from a multi-speaker environment. Based on their results, the authors found that the agent can successfully localise the target speaker among a set of predefined speakers in a room by avoiding confusion and going outside the predefined room boundaries.…”
Section: Other Applicationsmentioning
confidence: 99%
“…Their system consists of a sound propagation model and sound localization model based on classical (non-DNN) algorithms. Previous preliminary work by the authors in [5] created an autonomous agent based on deep reinforcement learning, able to navigate a virtual environment posing an audio-based navigation challenge, where a fixed number of speakers are present in a room, and guide itself towards a defined speaker while avoiding colliding with the other speakers, using only raw audio information from the environment. We extend upon this preliminary work by adding another environment focused on the task of audio source localization, unifying the training regime of the autonomous agent across both environments and studying the degree of agent knowledge transfer between the environments.…”
Section: Related Workmentioning
confidence: 99%