2004 IEEE International Conference on Acoustics, Speech, and Signal Processing
DOI: 10.1109/icassp.2004.1326036
|View full text |Cite
|
Sign up to set email alerts
|

Application of the modified group delay function to speaker identification and discrimination

Abstract: In this paper, we explore new methods by which speakers can be identified and discriminated, using features derived from the fourier transform phase. The Modified Group Delay Feature(MODGDF) which is a parameterized form of the modified group delay function is used as a front end feature in this study. A Gaussian mixture model(GMM) based speaker identification system is built with the MODGDF as the front end feature. The system is tested on both clean (TIMIT) and noisy telephone(NTIMIT) speech. The results obt… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
23
0

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 49 publications
(23 citation statements)
references
References 9 publications
0
23
0
Order By: Relevance
“…Usually only the magnitude spectrum is retained, based on the belief that phase has little perceptual importance. However, [179] provides opposing evidence while [96] described a technique which utilizes phase information.…”
Section: Short-term Spectral Featuresmentioning
confidence: 99%
“…Usually only the magnitude spectrum is retained, based on the belief that phase has little perceptual importance. However, [179] provides opposing evidence while [96] described a technique which utilizes phase information.…”
Section: Short-term Spectral Featuresmentioning
confidence: 99%
“…Due to the fact that these features are relatively unstable, and it is difficult to separate the phase information accurately, there has been little progress until recently. Robust ASR and Speaker Identification have been recent advances, where a Modified Group Delay Function (MODGDF) is used [17,18]. MODGDF has been tested in Language Identification but did not show a significant or stable performance improvement [17].…”
Section: Phase-based Featuresmentioning
confidence: 99%
“…It has been used in speaker identification (Hegde et al [84]), but also in speech analysis, speech segmentation, speech recognition and language identification frameworks (Murthy and Yegnanarayana [85]). …”
Section: Stft-based Frequency Featuresmentioning
confidence: 99%