2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2014
DOI: 10.1109/icassp.2014.6854189
|View full text |Cite
|
Sign up to set email alerts
|

Underdetermined blind separation and tracking of moving sources based ONDOA-HMM

Abstract: This paper deals with the problem of the underdetermined blind separation and tracking of moving sources. In practical situations, sound sources such as human speakers can move freely and so blind separation algorithms must be designed to track the temporal changes of the impulse responses. We propose solving this problem through the posterior inference of the parameters in a generative model of an observed multichannel signal, formulated under the assumption of the sparsity of time-frequency components of spe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
26
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(26 citation statements)
references
References 11 publications
0
26
0
Order By: Relevance
“…Alternatively, separation of moving sources can be achieved by tracking the spatial position or direction of arrival (DOA) of the sources and using spatial filtering (beamforming or separation mask) for extracting the signal originating from the estimated position or direction in each time instance. In [25] the problem of DOA tracking and separation mask estimation is formulated jointly, however in this paper we consider a two stage approach where the acoustic tracking is done first and the separation masks are estimated in a separate (offline) stage. Also the separation masks are binary in [25] which will lead to compromised subjective separation quality even if oracle masks are used.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Alternatively, separation of moving sources can be achieved by tracking the spatial position or direction of arrival (DOA) of the sources and using spatial filtering (beamforming or separation mask) for extracting the signal originating from the estimated position or direction in each time instance. In [25] the problem of DOA tracking and separation mask estimation is formulated jointly, however in this paper we consider a two stage approach where the acoustic tracking is done first and the separation masks are estimated in a separate (offline) stage. Also the separation masks are binary in [25] which will lead to compromised subjective separation quality even if oracle masks are used.…”
Section: Introductionmentioning
confidence: 99%
“…In [25] the problem of DOA tracking and separation mask estimation is formulated jointly, however in this paper we consider a two stage approach where the acoustic tracking is done first and the separation masks are estimated in a separate (offline) stage. Also the separation masks are binary in [25] which will lead to compromised subjective separation quality even if oracle masks are used.…”
Section: Introductionmentioning
confidence: 99%
“…This method mainly introduced a thresholding function to enforce sparsity. A model was proposed to describe a time-varying array response in the frequency domain for each source [23]. The key of this paper is that it utilizes a hidden Markov model to describe the moving signal and uses the posterior inference to estimate the signal positions.…”
Section: Introductionmentioning
confidence: 99%
“…al. [9,10,11], a framework addressing jointly the two problems seems overlooked in the literature; in [9,10] the active/inactive state of a source is independently modeled with a factorial HMM in a MASS framework. This independent modeling of the activity of a source with respect to the activity of the other sources may be unrealistic for multiparty conversations.…”
Section: Introductionmentioning
confidence: 99%
“…This independent modeling of the activity of a source with respect to the activity of the other sources may be unrealistic for multiparty conversations. In [10] the source activity detection is combined with a direction-of-arrival-dependent HMM for the propagation model. A variational expectation maximization (EM) is presented that infers the sources, their activity and the model parameters, although under the assumption of a single active source per time-frequency bin.…”
Section: Introductionmentioning
confidence: 99%