2016
DOI: 10.1109/taslp.2016.2555085
|View full text |Cite
|
Sign up to set email alerts
|

Semi-Supervised Sound Source Localization Based on Manifold Regularization

Abstract: Conventional speaker localization algorithms, based merely on the received microphone signals, are often sensitive to adverse conditions, such as: high reverberation or low signal to noise ratio (SNR). In some scenarios, e.g. in meeting rooms or cars, it can be assumed that the source position is confined to a predefined area, and the acoustic parameters of the environment are approximately fixed. Such scenarios give rise to the assumption that the acoustic samples from the region of interest have a distinct g… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
33
0
1

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
3
1

Relationship

2
7

Authors

Journals

citations
Cited by 67 publications
(34 citation statements)
references
References 46 publications
0
33
0
1
Order By: Relevance
“…In a supervised learning approach, the mapping between the binaural cues and the source locations is learnt from annotated data using a probabilistic piecewise affine regression model. A semi-supervised approach is proposed in [116] that uses RTF values input features in order to learn the source locations based on manifold regularization.…”
Section: ) Subspace Techniquesmentioning
confidence: 99%
“…In a supervised learning approach, the mapping between the binaural cues and the source locations is learnt from annotated data using a probabilistic piecewise affine regression model. A semi-supervised approach is proposed in [116] that uses RTF values input features in order to learn the source locations based on manifold regularization.…”
Section: ) Subspace Techniquesmentioning
confidence: 99%
“…One difficulty for ML-based methods in acoustics is the limited amount of labeled data and the complex acoustic propagation in natural environments, despite large volumes of recordings [1], [2]. This limitation has motivated recent approaches for localization based on semi-supervised learning (SSL) [15], [16].…”
Section: Introductionmentioning
confidence: 99%
“…Another classical SSL method, steered response power with phase transform (SRP-PHAT) [9][10][11] has been shown to not be robust to non-stationary signal like speech. Recently, SSL approaches based on deep neural networks (DNNs) have been proposed [13][14][15][16][17][18][19][20]. Most of the approaches are based on supervised learning.…”
Section: Introductionmentioning
confidence: 99%