2019
DOI: 10.1155/2019/7213717
|View full text |Cite
|
Sign up to set email alerts
|

DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network

Abstract: The speech entailed in human voice comprises essentially paralinguistic information used in many voice-recognition applications. Gender voice is considered one of the pivotal parts to be detected from a given voice, a task that involves certain complications. In order to distinguish gender from a voice signal, a set of techniques have been employed to determine relevant features to be utilized for building a model from a training set. This model is useful for determining the gender (i.e., male or female) from … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
31
0
2

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 56 publications
(33 citation statements)
references
References 17 publications
0
31
0
2
Order By: Relevance
“…To develop such a model for classification problems, one must choose the right feature from the human voice by which both gender and region of a person can be determined easily. There are most common features like MFCCs, Mel-scaled power spectrogram (Mel), power spectrogram chroma (Chroma), spectral contrast (Contrast), and tonal centroid features (Tonnetz) employed for gender recognition using 1D CNN model (Alkhawaldeh, 2019). Fundamental frequency, Energy, Spectral flatness, entropy, intensity, zero-crossing rate, etc.…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…To develop such a model for classification problems, one must choose the right feature from the human voice by which both gender and region of a person can be determined easily. There are most common features like MFCCs, Mel-scaled power spectrogram (Mel), power spectrogram chroma (Chroma), spectral contrast (Contrast), and tonal centroid features (Tonnetz) employed for gender recognition using 1D CNN model (Alkhawaldeh, 2019). Fundamental frequency, Energy, Spectral flatness, entropy, intensity, zero-crossing rate, etc.…”
Section: Literature Reviewmentioning
confidence: 99%
“…To solve this noise problem needs a pre-processing process for feature extraction. Pre-emphasis, frame blocking, the hamming window are used in (Alkhawaldeh, 2019;Yusnita et al, 2017) as a pre-processing phase for removal of the noise.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Apparently, not only pitch-related features, but also Mel-Frequency Cepstrum Coefficients (MFCC) and energy-related features can play an important role for gender identification. A combination among the above-mentioned features also ensured particularly better results, as demonstrated in [21], [22]. Some other approaches were also tried to identify the gender of a speaker directly from raw audio signals like in [23] where a CNN was trained to directly extract features from the raw signal in a filtering stage and then the classification was performed.…”
Section: Related Workmentioning
confidence: 99%
“…They claimed that the model is very effective even if the amount of learning data is limited. Alkhawaldeh et al [23] described a gender classification model with the help of one dimension convolution neural network. They used the features, such as Mel Spectrogram, Mel Frequency Cepstral Coefficients, as the single dimension input sequence to CNN for training of the model.…”
Section: Literature Surveymentioning
confidence: 99%