2007
DOI: 10.1007/978-1-4020-6479-1_9
|View full text |Cite
|
Sign up to set email alerts
|

K-means Based Underdetermined Blind Speech Separation

Abstract: Abstract. This chapter addresses a blind sparse source separation method that can employ arbitrarily arranged multiple microphones. Some sparse source separation methods, which rely on source sparseness and an anechoic mixing model, have already been proposed. The validity of the sparseness and anechoic assumptions will be investigated in this chapter. As most of the existing methods utilize a stereo (two sensors) system, they limit the separation ability to a 2-dimensional half-plane. This chapter describes a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
19
0

Year Published

2009
2009
2017
2017

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 13 publications
(21 citation statements)
references
References 25 publications
2
19
0
Order By: Relevance
“…To estimate such TF points, a spatial feature vector is calculated from the STFT representations of the M observations. Previous researches [14,15] have identified level ratios and phase differences between the observations as appropriate features, as such features retain information on the magnitude and the argument of the TF points. Further discussion is presented in section 4.3.1.…”
Section: Feature Extractionmentioning
confidence: 99%
See 2 more Smart Citations
“…To estimate such TF points, a spatial feature vector is calculated from the STFT representations of the M observations. Previous researches [14,15] have identified level ratios and phase differences between the observations as appropriate features, as such features retain information on the magnitude and the argument of the TF points. Further discussion is presented in section 4.3.1.…”
Section: Feature Extractionmentioning
confidence: 99%
“…Whilst remaining similar in spirit to the DUET, the research was inclusive of non-ideal conditions such as room reverberation, and allowed more than two sensors in an arbitrary arrangement. This eventually culminated in the development of the multiple sensors degenerate unmixing estimation technique, termed MENUET [14,15]. Additionally, the mask estimation in MENUET was automated through the application of the k-means clustering technique.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The relative attenuation and delay histograms are used to determine the mixing parameters, before the source signals are estimated by TF binary masking. This approach is extended to more than 2 microphones in [11] [12].…”
Section: ( []mentioning
confidence: 99%
“…11.5). Therefore the impact of the noise due to the watermarking and the sparse assumption on the inversion process (see Section III-C) also remains limited 11 .…”
Section: A Datamentioning
confidence: 99%