2018
DOI: 10.1007/s10772-018-09576-4
|View full text |Cite
|
Sign up to set email alerts
|

Improving the performance of the speaker emotion recognition based on low dimension prosody features vector

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 33 publications
0
5
0
Order By: Relevance
“…In [46], semi-NMF with k-means clustering initialization was used to transform feature sets, which were further combined with the original dataset to obtain a total of 72 features for SER obtaining 77.74% accuracy. In [47][48][49][50][51][52], different optimizing and feature selection techniques, namely enhanced kernel isometric mapping, the modified supervised locally linear embedding algorithm, sparse partial least squares regression, sequential floating forward selection, the scaled conjugate gradient, and principal component analysis, were used for improving the classification accuracy by reducing the feature set dimension. However, the classification accuracy obtained with the proposed SER system is higher than the other methods with 90% (approx.)…”
Section: Resultsmentioning
confidence: 99%
“…In [46], semi-NMF with k-means clustering initialization was used to transform feature sets, which were further combined with the original dataset to obtain a total of 72 features for SER obtaining 77.74% accuracy. In [47][48][49][50][51][52], different optimizing and feature selection techniques, namely enhanced kernel isometric mapping, the modified supervised locally linear embedding algorithm, sparse partial least squares regression, sequential floating forward selection, the scaled conjugate gradient, and principal component analysis, were used for improving the classification accuracy by reducing the feature set dimension. However, the classification accuracy obtained with the proposed SER system is higher than the other methods with 90% (approx.)…”
Section: Resultsmentioning
confidence: 99%
“…The DAN is a type of deep learning, which is advanced machine learning. Deep learning is widely employed in artificial intelligence domains such as medicine [18], images processing [19], information and sound application [20], computer vision [21], and many more applications [15], [22]- [28]. In this paper, the DAN is suggested, implemented and evaluated.…”
Section: Deep Autoencoder Networkmentioning
confidence: 99%
“…Facial landmark recognition algorithms are computer vision techniques designed to identify and locate specific facial landmarks or keypoints on a human face. These algorithms play a crucial role in various applications such as face analysis, facial expression recognition, face tracking, augmented reality, and drowsiness detection systems [23]. Finding the face in the image as well as identifying the points which create the face structure are the goals of Facial landmark.…”
Section: Facial Landmark Recognition Algorithmmentioning
confidence: 99%