Ninth IEEE International Symposium on Multimedia Workshops (ISMW 2007) 2007
DOI: 10.1109/ism.workshops.2007.47
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of Speech Enhancement Techniques for Speaker Identification in Noisy Environments

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
18
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 28 publications
(18 citation statements)
references
References 6 publications
0
18
0
Order By: Relevance
“…Next the estimated dynamical features vector is feed to a Gaussian Mixture Model, GMM, which is used to obtain a representative model for each speaker. Taking into account that a SRS can be improved taking the voiced part of the speech signal because this contains the main information relative to the speaker identity (El-Solh, 2007;Markov & Nakagawa, 1999;Plumper et al, 1999). For this reason in this book the features vector will be derived from the LPC-cepstral extracted only from the voiced part speech signal.…”
Section: Speaker Recognition Systemmentioning
confidence: 99%
“…Next the estimated dynamical features vector is feed to a Gaussian Mixture Model, GMM, which is used to obtain a representative model for each speaker. Taking into account that a SRS can be improved taking the voiced part of the speech signal because this contains the main information relative to the speaker identity (El-Solh, 2007;Markov & Nakagawa, 1999;Plumper et al, 1999). For this reason in this book the features vector will be derived from the LPC-cepstral extracted only from the voiced part speech signal.…”
Section: Speaker Recognition Systemmentioning
confidence: 99%
“…The results of [2] suggest that optimizing the quality of speech perception or SNR, using speech quality enhancement techniques, does not optimize speech processing performance. Indeed, speech quality enhancement techniques may actually degrade the speech processing performance instead of improving it.…”
Section: Introductionmentioning
confidence: 97%
“…The performance of most speech processing systems is degraded severely when the training and the testing audio are captured in different acoustic environments (e.g., different levels of noise [1][2][3][4]). For example, training speech may be captured using a telephone handset in a noise-free environment, while the test speech is captured using a handsfree phone in a noisy environment.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations