Interspeech 2014 2014
DOI: 10.21437/interspeech.2014-565
|View full text |Cite
|
Sign up to set email alerts
|

Integrating sequence information in the audio-visual detection of word prominence in a human-machine interaction scenario

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…The results indicate that Kernel ESZSL and SYNC-OVO / SYNC-OVO (rand) perform well on the GEMEP corpus, while EXEM (1NNS) and SYNC-OVO / SYNC-OVO (rand) achieve better performance on the DEMoS corpus (Figure 3). As an additional investigation, we present the UA and macro F1-score (calculated through averaging recalls and precision-recall integration) [71], [73] results of the strategies of Kernel ESZSL, EXEM (1NNS), SYNC-OVO, and SYNC-OVO (rand). The F1-score results obey the tendency of the UAs (Table VII), indicating that the analysis on the UAs in this work is possible to be transferred onto the joint precisionaccuracy measurement.…”
Section: B Experimental Results: Strategic Comparisonmentioning
confidence: 99%
See 1 more Smart Citation
“…The results indicate that Kernel ESZSL and SYNC-OVO / SYNC-OVO (rand) perform well on the GEMEP corpus, while EXEM (1NNS) and SYNC-OVO / SYNC-OVO (rand) achieve better performance on the DEMoS corpus (Figure 3). As an additional investigation, we present the UA and macro F1-score (calculated through averaging recalls and precision-recall integration) [71], [73] results of the strategies of Kernel ESZSL, EXEM (1NNS), SYNC-OVO, and SYNC-OVO (rand). The F1-score results obey the tendency of the UAs (Table VII), indicating that the analysis on the UAs in this work is possible to be transferred onto the joint precisionaccuracy measurement.…”
Section: B Experimental Results: Strategic Comparisonmentioning
confidence: 99%
“…To select optimal parameters for each strategies, we utilise emotion-independent 5-fold and 3-fold Cross-Validation (CV) in grid-searching on the GEMEP and DEMoS corpora, respectively, which makes the validation set include samples from two emotions in each CV round. The measurement of Unweighted Accuracy (UA) is chosen in the CV rounds as the standard, calculated through averaging recalls [7], [16], [71].…”
Section: Experimental Setup For Learning Strategiesmentioning
confidence: 99%