Interspeech 2021 2021
DOI: 10.21437/interspeech.2021-1821
|View full text |Cite
|
Sign up to set email alerts
|

Ensemble-Within-Ensemble Classification for Escalation Prediction from Speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…Log-Mel spectrograms are widely used in deep learning for various CV tasks such as speech escalation detection [ 114 ], audio classification [ 115 ], and ASR [ 116 ]. In the current work, we also use log-Mel spectrograms, and as deep learning models we implement three 2DCNN models: ResNet [ 51 ], VGG [ 117 ], and PANN [ 118 ].…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Log-Mel spectrograms are widely used in deep learning for various CV tasks such as speech escalation detection [ 114 ], audio classification [ 115 ], and ASR [ 116 ]. In the current work, we also use log-Mel spectrograms, and as deep learning models we implement three 2DCNN models: ResNet [ 51 ], VGG [ 117 ], and PANN [ 118 ].…”
Section: Methodsmentioning
confidence: 99%
“…In the current work, we also use log-Mel spectrograms, and as deep learning models we implement three 2DCNN models: ResNet [ 51 ], VGG [ 117 ], and PANN [ 118 ]. The selected models have been repeatedly used in CV tasks for audio modality processing [ 114 , 116 ]. Model architectures are shown in Figure 4 .…”
Section: Methodsmentioning
confidence: 99%