A Novel S-LDA Features for Automatic Emotion Recognition from Speech using 1-D CNN

Tiwari, Pradeep; Darji, Anand D.

doi:10.33889/ijmems.2022.7.1.004

Cited by 3 publications

(3 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Additionally, when compared to other classifiers, the MLP classifier achieves superior results across all three datasets. Shifted Linear Discriminant Analysis (S-LDA) is proposed in paper [34] to derive dynamic attributes from static lowlevel variables like MFCC and Pitch. These adjusted features go into a 1D-CNN to extract high-level features for automatic event recognition (AER).…”

Section: Related Workmentioning

confidence: 99%

“…Furthermore, the spectral centroid yields a single scalar value at (34), indicating where most of the audio signal's energy lies in frequency; spectral contrast with seven scaler values at index (35 -41) represents the difference in amplitude between peaks and troughs in the audio spectrum. Then, the third spectral roll-off feature at index (42) with a single value marks the frequency below which a specific percentage (e.g., 85%) of the total spectral energy lies.…”

Section: B Speech Characteristicsmentioning

confidence: 99%

See 1 more Smart Citation

XEmoAccent: Embracing Diversity in Cross-Accent Emotion Recognition Using Deep Learning

Ahmad,

Iqbal,

Mohsin Jadoon

et al. 2024

IEEE Access

View full text Add to dashboard Cite

Speech is a powerful means of expressing thoughts, emotions, and perspectives. However, accurately determining the emotions conveyed through speech remains a challenging task. Existing manual methods for analyzing speech to recognize emotions are prone to errors, limiting our understanding and response to individuals' emotional states. To address diverse accents, an automated system capable of realtime emotion prediction from human speech is needed. This paper introduces a speech emotion recognition (SER) system that leverages supervised learning techniques to tackle cross-accent diversity. Distinctively, the system extracts a comprehensive set of nine speech features-Zero Crossing Rate, Mel Spectrum, Pitch, Root Mean Square values, Mel Frequency Cepstral Coefficients, chroma-stft, and three spectral features (Centroid, Contrast, and Roll-off) for refined speech signal processing and recognition. Seven machine learning models are employed, encompassing Random Forest, Logistic Regression, Decision Tree, Support Vector Machines, Gaussian Naive Bayes, K-Nearest Neighbors, ensemble learning, and four individual, hybrid deep learning models including Long short-term memory (LSTM) and 1-Dimensional Convolutional Neural Network (1D-CNN) with stratified cross-validation. Audio samples from diverse English regions are combined to train the models. The performance evaluation results of conventional machine learning and deep learning models indicate that the Random Forest-based feature selection model achieves the highest accuracy of up to 76% among the conventional machine learning models. Simultaneously, the 1D-CNN model with stratified cross-validation reaches up to 99% accuracy. The proposed framework enhances the cross-accent emotion recognition accuracy up to 86.3%, 89.87%, 90.27%, and 84.96% by margins of 14.71%, 10.15%, 9.6%, and 16.52% respectively.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: B Speech Characteristicsmentioning

confidence: 99%

XEmoAccent: Embracing Diversity in Cross-Accent Emotion Recognition Using Deep Learning

Ahmad,

Iqbal,

Mohsin Jadoon

et al. 2024

IEEE Access

View full text Add to dashboard Cite

show abstract

“…CNN has the ability of representation learning to pan unclassify the input information according to the hierarchical structure and it has been widely used in image classification (Bisht and Gupta, 2020), and speech recognition (Tiwari and Darji, 2022). 1D CNN, then, is an application of CNN model to the extraction of one-dimensional signals.…”

Section: Reliability Prediction Modelmentioning

confidence: 99%

Reliability Evaluation and Prediction Method with Small Samples

Dui

Dong

Tao

2023

Int. j. math. eng. manag. sci.

View full text Add to dashboard Cite

How to accurately evaluate and predict the degradation state of the components with small samples is a critical and practical problem. To address the problems of unknown degradation state of components, difficulty in obtaining relevant environmental data and small sample size in the field of reliability prediction, a reliability evaluation and prediction method based on Cox model and 1D CNN-BiLSTM model is proposed in this paper. Taking the historical fault data of six components of a typical load-haul-dump (LHD) machine as an example, a reliability evaluation method based on Cox model with small sample size is applied by comparing the reliability evaluation models such as logistic regression (LR) model, support vector machine (SVM) model and back propagation neural network (BPNN) model in a comprehensive manner. On this basis, a reliability prediction method based on one-dimensional convolutional neural network-bi-directional long and short-term memory network (1D CNN-BiLSTM) is applied with the objective of minimizing the prediction error. The applicability as well as the effectiveness of the proposed model is verified by comparing typical time series prediction models such as the autoregressive integrated moving average (ARIMA) model and multiple linear regression (MLR). The experimental results show that the proposed model is valuable for the development of reliability plans and for the implementation of reliability maintenance activities.

show abstract

Pertinent feature selection techniques for automatic emotion recognition in stressed speech

Tiwari

Darji

2022

Int J Speech Technol

View full text Add to dashboard Cite

A Novel S-LDA Features for Automatic Emotion Recognition from Speech using 1-D CNN

Cited by 3 publications

References 39 publications

XEmoAccent: Embracing Diversity in Cross-Accent Emotion Recognition Using Deep Learning

XEmoAccent: Embracing Diversity in Cross-Accent Emotion Recognition Using Deep Learning

Reliability Evaluation and Prediction Method with Small Samples

Pertinent feature selection techniques for automatic emotion recognition in stressed speech

Contact Info

Product

Resources

About