2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017
DOI: 10.1109/icassp.2017.7951789
|View full text |Cite
|
Sign up to set email alerts
|

An EM algorithm for joint source separation and diarisation of multichannel convolutive speech mixtures

Abstract: We present a probabilistic model for joint source separation and diarisation of multichannel convolutive speech mixtures. We build upon the framework of local Gaussian model (LGM) with non-negative matrix factorization (NMF). The diarisation is introduced as a temporal labeling of each source in the mix as active or inactive at the short-term frame level. We devise an EM algorithm in which the source separation process is aided by the diarisation state, since the latter indicates the sources actually present i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
18
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
3
1
1

Relationship

3
2

Authors

Journals

citations
Cited by 12 publications
(18 citation statements)
references
References 20 publications
0
18
0
Order By: Relevance
“…In the present paper we extend the work in [14] by proposing a probabilistic model for simultaneous MASS and diarization of under-determined convolutive speech mixtures, that does not use the narrow-band assumption but is based instead on the spatial covariance matrix model (SCM) [1,16]. The source activity model we use here is a variant of [14].…”
Section: Introductionmentioning
confidence: 99%
See 4 more Smart Citations
“…In the present paper we extend the work in [14] by proposing a probabilistic model for simultaneous MASS and diarization of under-determined convolutive speech mixtures, that does not use the narrow-band assumption but is based instead on the spatial covariance matrix model (SCM) [1,16]. The source activity model we use here is a variant of [14].…”
Section: Introductionmentioning
confidence: 99%
“…A categorical random variable Z = n, n ∈ [1, N ] selects the diarization dZ at frame . Z is modelled by an HMM as in [14]:…”
Section: Source Activity Modelmentioning
confidence: 99%
See 3 more Smart Citations