2023
DOI: 10.14569/ijacsa.2023.0140623
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Features Audio Extraction for Speech Emotion Recognition Based on Deep Learning

Abstract: The increasing need for human interaction with computers makes the interaction process more advanced, one of which is by utilizing voice recognition. Developing a voice command system also needs to consider the user's emotional state because the users indirectly treat computers like humans in general. By knowing the type of a person's emotions, the computer can adjust the type of feedback that will be given so that the human-computer interaction (HCI) process will run more humanely. Based on the results of pre… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 29 publications
0
1
0
Order By: Relevance
“…There are roughly 100 samples per category, so roughly 800 samples in total. The following features are extracted and trained on: Mel-frequency cepstral coefficients (MFCC), chromagram (Chroma), and Mel-frequency cepstrum (Mel), Contrast and German Tone Network (Tonnetz) [37] [38].…”
Section: Training Datamentioning
confidence: 99%
“…There are roughly 100 samples per category, so roughly 800 samples in total. The following features are extracted and trained on: Mel-frequency cepstral coefficients (MFCC), chromagram (Chroma), and Mel-frequency cepstrum (Mel), Contrast and German Tone Network (Tonnetz) [37] [38].…”
Section: Training Datamentioning
confidence: 99%