2021
DOI: 10.17762/turcomat.v12i5.2030
|View full text |Cite
|
Sign up to set email alerts
|

An Enhanced CNN-2D for Audio-Visual Emotion Recognition (AVER) Using ADAM Optimizer

Abstract: The importance of integrating visual components into the speech recognition process for improving robustness has been identified by recent developments in audio visual emotion recognition (AVER). Visual characteristics have a strong potential to boost the accuracy of current techniques for speech recognition and have become increasingly important when modelling speech recognizers. CNN is very good to work with images. An audio file can be converted into image file like a spectrogram with good frequency to extr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 16 publications
0
1
0
Order By: Relevance
“…Nowadays, the recognition of handwritten digits is a dynamic area of exploration within the field of handwriting recognition. Numerous systems for recognizing handwritten digits have been put forth recently to meet the requirements of practical applications, emphasizing the need for high accuracy and reliability in recognition [1]. Handwriting digit recognition is the method that utilizes a computer or machine learning model to recognize and interpret handwritten digits, typically in the range of 0 to 9.…”
Section: Introductionmentioning
confidence: 99%
“…Nowadays, the recognition of handwritten digits is a dynamic area of exploration within the field of handwriting recognition. Numerous systems for recognizing handwritten digits have been put forth recently to meet the requirements of practical applications, emphasizing the need for high accuracy and reliability in recognition [1]. Handwriting digit recognition is the method that utilizes a computer or machine learning model to recognize and interpret handwritten digits, typically in the range of 0 to 9.…”
Section: Introductionmentioning
confidence: 99%