Weipeng He scite author profile

We propose to use neural networks for simultaneous detection and localization of multiple sound sources in human-robot interaction. In contrast to conventional signal processing techniques, neural network-based sound source localization methods require fewer strong assumptions about the environment. Previous neural network-based methods have been focusing on localizing a single sound source, which do not extend to multiple sources in terms of detection and localization.In this paper, we thus propose a likelihood-based encoding of the network output, which naturally allows the detection of an arbitrary number of sources. In addition, we investigate the use of sub-band cross-correlation information as features for better localization in sound mixtures, as well as three different network architectures based on different motivations. Experiments on real data recorded from a robot show that our proposed methods significantly outperform the popular spatial spectrum-based approaches.

show abstract

Characteristic analysis on temporal evolution of floc size and structure in low-shear flow

Nan

et al. 2012

Water Research

118

View full text Add to dashboard Cite

Neural Network Adaptation and Data Augmentation for Multi-Speaker Direction-of-Arrival Estimation

Motlíček

Odobez

2021

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-adversarial Training

Motlíček

Odobez

2019

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Weipeng He

Deep Neural Networks for Multiple Speaker Detection and Localization

Characteristic analysis on temporal evolution of floc size and structure in low-shear flow

Neural Network Adaptation and Data Augmentation for Multi-Speaker Direction-of-Arrival Estimation

Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-adversarial Training

Contact Info

Product

Resources

About