2010 IEEE International Conference on Acoustics, Speech and Signal Processing 2010
DOI: 10.1109/icassp.2010.5496188
|View full text |Cite
|
Sign up to set email alerts
|

HMM-based separation of acoustic transfer function for single-channel sound source localization

Abstract: This paper presents a sound source (talker) localization method using only a single microphone, where a HMM (Hidden Markov Model) of clean speech is introduced to estimate the acoustic transfer function from a user's position. The new method is able to carry out this estimation without measuring impulse responses. The frame sequence of the acoustic transfer function is estimated by maximizing the likelihood of training data uttered from a given position, where the cepstral parameters are used to effectively re… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2011
2011
2023
2023

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 12 publications
(7 citation statements)
references
References 8 publications
0
7
0
Order By: Relevance
“…In our previous work, 12 using the estimated frame sequence data of the acoustic transfer function, the GMM for the acoustic transfer function was trained for each user's position. For test data, the talker position was estimated by finding a GMM having the maximum-likelihood from among the estimated GMMs corresponding to each position.…”
Section: Dimensional Feature Weighting and Classification Using mentioning
confidence: 99%
See 3 more Smart Citations
“…In our previous work, 12 using the estimated frame sequence data of the acoustic transfer function, the GMM for the acoustic transfer function was trained for each user's position. For test data, the talker position was estimated by finding a GMM having the maximum-likelihood from among the estimated GMMs corresponding to each position.…”
Section: Dimensional Feature Weighting and Classification Using mentioning
confidence: 99%
“…12 The estimation is implemented by maximizing the likelihood of the observed speech data from a user's position.…”
Section: B Maximum-likelihood-based Parameter Estimationmentioning
confidence: 99%
See 2 more Smart Citations
“…Takashima, et al [23] use an HMM-based approach to separate the acoustic transfer function so that they can separate the sources, using a single microphone. It is done by using an HMM model of the speech of each speaker to estimate the acoustic transfer function from each position in the room.…”
Section: Multiple Sources Of Speech and Far-field Audio Capturementioning
confidence: 99%