2016
DOI: 10.5815/ijigsp.2016.09.03
|View full text |Cite
|
Sign up to set email alerts
|

Speech Feature Extraction for Gender Recognition

Abstract: Speech Recognition Technology can be embedded in various real time applications in order to increase the human-computer interaction. From robotics to health care and aerospace, from interactive voice response systems to mobile telephony and telematics, speech recognition technology have enhanced the humanmachine interaction. Gender recognition is an important component for the application embedding speech recognition as it reduces the computational complexity for the further processing in these applications. T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
25
0
1

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 36 publications
(26 citation statements)
references
References 1 publication
0
25
0
1
Order By: Relevance
“…x is the value of the ith sample and  is the preemphasis coefficient. The value of α used in this experiment is 0.9, which is the same value used in [3,19]. This procedure is used to spectrally flatten the speech signal causing the perseverance of high-frequency components of the samples.…”
Section: A Pre-processingmentioning
confidence: 99%
“…x is the value of the ith sample and  is the preemphasis coefficient. The value of α used in this experiment is 0.9, which is the same value used in [3,19]. This procedure is used to spectrally flatten the speech signal causing the perseverance of high-frequency components of the samples.…”
Section: A Pre-processingmentioning
confidence: 99%
“…The speech analyzer uses Feature Extraction process to remove the background noise from the signals and retain the words that have been spoken into the microphone as shown in Fig. 4 [14].…”
Section: Step 3: Removal Of the Background Noisementioning
confidence: 99%
“…GR can, for example, be achieved using low frequency data from the outline of a human face [4][5][6][7] ; kinematic data from gait analysis 3,8 ; skin texture 9 ; keystroke 10 ; voice; [11][12][13] and speech. [14][15][16][17][18] Recently some group of researchers worked on Parkinson's disease detection and GR using the deep network including pooling and feature extraction methods, to improve on the existing methods on GR. For instance, Tuncer and Dogan 19,20 and Yaman et al 21 worked on Parkinson's disease and gender detection.…”
Section: Introductionmentioning
confidence: 99%
“…Studies have shown that audio features such as the long-term structure of the audio spectrum, 35 Mel Frequency Cepstral Coefficient (MFCC), 36,37 its derivatives and transformations, 11,16,17,38,39 and Perceptual Linear Predictive, 36,37 are useful for gender classification and emotion recognition. Acoustic and prosodic features of speech 14,15,25 are other techniques that have been used for this classification and recognition.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation