2015
DOI: 10.1007/s11265-015-1005-5
|View full text |Cite
|
Sign up to set email alerts
|

Text-Independent Speaker Identification Using Vowel Formants

Abstract: Automatic speaker identification has become a challenging research problem due to its wide variety of applications. Neural networks and audio-visual identification systems can be very powerful, but they have limitations related to the number of speakers. The performance drops gradually as more and more users are registered with the system. This paper proposes a scalable algorithm for real-time text-independent speaker identification based on vowel recognition. Vowel formants are unique across different speaker… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 22 publications
(6 citation statements)
references
References 38 publications
0
6
0
Order By: Relevance
“…This study aims at hiding the global and inherent features of speakers, i.e., the vocal tract related spectral features (cf. Almaadeed et al (2016)) and some learned features, i.e., pitch and speaking rate. This translates to making changes in speech that relate to vocal tract length, average formant frequencies and intensities, pitch, and speaking rate.…”
Section: Proposed Pseudonymization Methodsmentioning
confidence: 99%
“…This study aims at hiding the global and inherent features of speakers, i.e., the vocal tract related spectral features (cf. Almaadeed et al (2016)) and some learned features, i.e., pitch and speaking rate. This translates to making changes in speech that relate to vocal tract length, average formant frequencies and intensities, pitch, and speaking rate.…”
Section: Proposed Pseudonymization Methodsmentioning
confidence: 99%
“…One method of identifying a speaker on the basis of only acoustic information is to use individual variations in voice [35] such as in Mel-frequency cepstral coefficients (MFCC) [36]- [38] and linear predictive coding [37], [39] along with vector quantization [36], [39] or a Gaussian mixture model [36]- [39]. However, reliable speaker identification requires a certain speaking duration.…”
Section: Related Work and Design Approachmentioning
confidence: 99%
“…For that, many features have been investigated in the literature [2], [11] where the cepstral features [6] are the most appropriate ones for speaker recognition tasks. Up today, the most popular and successful cepstral features are Mel frequency cepstral coefficients (MFCC) [3], [10], [22].…”
Section: Related Workmentioning
confidence: 99%