2017
DOI: 10.1016/j.eswa.2017.05.037
|View full text |Cite
|
Sign up to set email alerts
|

Age and gender classification from speech and face images by jointly fine-tuned deep neural networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
30
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
4

Relationship

1
9

Authors

Journals

citations
Cited by 57 publications
(30 citation statements)
references
References 9 publications
0
30
0
Order By: Relevance
“…e ordinary frequency is converted to Mel frequency, and the first 12 steps of MFCC are extracted. e most widely used classifiers for speech recognition are HMM [25,26], GMM [27,28], ANN [29,30], and SVM [31,32]. In this paper, to improve the separability of data, the SVM classifier is used to generate a nonlinear mapping of the original features to a high-dimensional space; the choice of kernel function is Radial Basis Function (RBF).…”
Section: Mfcc Feature Extractionmentioning
confidence: 99%
“…e ordinary frequency is converted to Mel frequency, and the first 12 steps of MFCC are extracted. e most widely used classifiers for speech recognition are HMM [25,26], GMM [27,28], ANN [29,30], and SVM [31,32]. In this paper, to improve the separability of data, the SVM classifier is used to generate a nonlinear mapping of the original features to a high-dimensional space; the choice of kernel function is Radial Basis Function (RBF).…”
Section: Mfcc Feature Extractionmentioning
confidence: 99%
“…A resurgence of interest has been seen in last few years towards artificial neural networks, specifically deep learning has been used extensively after its spectacular success in the area of image classification, regression problems in time series data, and natural language processing [44][45][46]. This has been used in stock market analysis and prediction [47][48][49], breast cancer classification [50], classification of electrocardiogram signals [51], image classification [52], object recognition [53], medical image analysis [54], and time-series data analysis [55]. Deep learning, otherwise known as feature learning, has been well suited for deriving basic features from the input data and the obtained extracted features can be used to train a model in a better way.…”
Section: Related Workmentioning
confidence: 99%
“…They argue that the proposed method achieves higher success than the existing methods. [19] developed a new cost function from the well-known softmax and sigmoid functions for concurrent and jointly fine-tuned deep neural networks to classify age and gender from speech and face images. They proposed a new deep neural network architecture.…”
Section: Introductionmentioning
confidence: 99%