2020
DOI: 10.17743/jaes.2019.0043
|View full text |Cite
|
Sign up to set email alerts
|

Continuous Speech Emotion Recognition with Convolutional Neural Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

1
24
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 53 publications
(25 citation statements)
references
References 0 publications
1
24
0
Order By: Relevance
“…Such analytics, in correlation with the delivered content, provide insight for future planning. The baseline metadata scheme can also be extended to involve speech [55,56] and music [57,58] emotional cues. As it is depicted in Figure 4, the functionality that concerns different groups of interest is unified in a common framework.…”
Section: 2a Web Application For Live Radio Production and Annotationmentioning
confidence: 99%
See 3 more Smart Citations
“…Such analytics, in correlation with the delivered content, provide insight for future planning. The baseline metadata scheme can also be extended to involve speech [55,56] and music [57,58] emotional cues. As it is depicted in Figure 4, the functionality that concerns different groups of interest is unified in a common framework.…”
Section: 2a Web Application For Live Radio Production and Annotationmentioning
confidence: 99%
“…To make the most of the available data and improve generalization, some common audio data augmentation techniques [56,60] have been applied:…”
Section: Speaker Recognition With Convolutional Neural Networkmentioning
confidence: 99%
See 2 more Smart Citations
“…In this context, new audio recognition and semantic analysis techniques are deployed for General Audio Detection and Classification (GADC) tasks, which are very useful in many multidisciplinary domains [4][5][6][7][8][9][10][11][12][13][14][15][16]. Typical examples include speech recognition and perceptual enhancement [5][6][7][8], speaker indexing and diarization [14][15][16][17][18][19], voice/music detection and discrimination [1][2][3][4][9][10][11][12][13][20][21][22], information retrieval and genre classification of music [23,24], audio-driven alignment of multiple recordings [25,26], sound emotion recognition [27][28][29] and others [10,[30][31][32]. Concerning the media production and broadcasting domain, audio and audio-driven segmentation allow for the implementation of prope...…”
Section: Introductionmentioning
confidence: 99%