HEMVIP: Human Evaluation of Multiple Videos in Parallel

Jonell, Patrik; Yoon, Youngwoo; Wolfert, Pieter; Kucherenko, Taras; Henter, Gustav Eje

doi:10.48550/arxiv.2101.11898

Cited by 2 publications

(5 citation statements)

References 12 publications

(29 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Although there are similarities, the two orderings are meaningfully different. This, together with the results in [25], reinforces a conclusion that the two studies managed to disentangle aspects of perceived motion quality (human-likeness) from the perceived link between gesture and speech (appropriateness). Figure 5, meanwhile, visualises confidence regions for the median rating as boxes whose horizontal and vertical extents are 2: Box plots visualising the ratings distribution in the two studies.…”

Section: Analysis and Results Of Subjective Evaluationsupporting

confidence: 80%

“…These speech segments, which were not revealed to participants, were selected across the test inputs to be full and/or coherent phrases. The motion from the corresponding intervals in the BVH files submitted by participating teams was extracted and converted to a motion video originates from [25], and was changed for each of the two evaluations in this paper. clip using the visualisation server provided to participants (see Section 5.1), albeit at a higher resolution of 960×540 this time.…”

Section: Stimulimentioning

confidence: 99%

“…For a detailed explanation of the evaluation interface we refer the reader to [25], which introduced and validated the evaluation paradigm for gesture-motion stimuli.…”

Section: Evaluation Interfacementioning

confidence: 99%

“…The question asked in the image ("How well do the character's movements reflect what the character says? ")originates from[25], and was changed for each of the two evaluations in this paper.…”

mentioning

confidence: 99%

See 3 more Smart Citations

A Large, Crowdsourced Evaluation of Gesture Generation Systems on Common Data: The GENEA Challenge 2020

Kucherenko

Jonell

Yoon

et al. 2021

26th International Conference on Intelligent User Interfaces

Self Cite

View full text Add to dashboard Cite

Co-speech gestures, gestures that accompany speech, play an important role in human communication. Automatic co-speech gesture generation is thus a key enabling technology for embodied conversational agents (ECAs), since humans expect ECAs to be capable of multi-modal communication. Research into gesture generation is rapidly gravitating towards data-driven methods. Unfortunately, individual research efforts in the field are difficult to compare: there are no established benchmarks, and each study tends to use its own dataset, motion visualisation, and evaluation methodology. To address this situation, we launched the GENEA Challenge, a gesturegeneration challenge wherein participating teams built automatic gesture-generation systems on a common dataset, and the resulting systems were evaluated in parallel in a large, crowdsourced user study using the same motion-rendering pipeline. Since differences in evaluation outcomes between systems now are solely attributable to differences between the motion-generation methods, this enables benchmarking recent approaches against one another in order to get a better impression of the state of the art in the field. This paper reports on the purpose, design, results, and implications of our challenge.

show abstract

Section: Analysis and Results Of Subjective Evaluationsupporting

confidence: 80%

Section: Stimulimentioning

confidence: 99%

“…For a detailed explanation of the evaluation interface we refer the reader to [25], which introduced and validated the evaluation paradigm for gesture-motion stimuli.…”

Section: Evaluation Interfacementioning

confidence: 99%

mentioning

confidence: 99%

See 2 more Smart Citations

A Large, Crowdsourced Evaluation of Gesture Generation Systems on Common Data: The GENEA Challenge 2020

Kucherenko

Jonell

Yoon

et al. 2021

26th International Conference on Intelligent User Interfaces

Self Cite

View full text Add to dashboard Cite

show abstract

“…Each evaluation page presented the videos of four conditions for the same speech, and a participant rated each video on a scale of 0-100. This evaluation method [17] was inspired by MUSHRA [42], which is the standardized evaluation method for comparing audio qualities. We evaluated two different aspects of gestures as the GENEA Challenge did.…”

Section: Subjective Gesture Quality Evaluationmentioning

confidence: 99%

SGToolkit: An Interactive Gesture Authoring Toolkit for Embodied Conversational Agents

Yoon

Park

Jang

et al. 2021

The 34th Annual ACM Symposium on User Interface Software and Technology

Self Cite

View full text Add to dashboard Cite

Non-verbal behavior is essential for embodied agents like social robots, virtual avatars, and digital humans. Existing behavior authoring approaches including keyframe animation and motion capture are too expensive to use when there are numerous utterances requiring gestures. Automatic generation methods show promising results, but their output quality is not satisfactory yet, and it is hard to modify outputs as a gesture designer wants. We introduce a new gesture generation toolkit, named SGToolkit, which gives a higher quality output than automatic methods and is more efficient than manual authoring. For the toolkit, we propose a neural generative model that synthesizes gestures from speech and accommodates fine-level pose controls and coarse-level style controls from users. The user study with 24 participants showed that the toolkit is favorable over manual authoring, and the generated gestures were also human-like and appropriate to input speech. The SGToolkit is platform agnostic, and the code is available at https://github.com/ai4r/SGToolkit. CCS Concepts: • Human-centered computing → Interaction techniques; Interactive systems and tools; • Computer systems organization → Robotics.

show abstract

HEMVIP: Human Evaluation of Multiple Videos in Parallel

Cited by 2 publications

References 12 publications

A Large, Crowdsourced Evaluation of Gesture Generation Systems on Common Data: The GENEA Challenge 2020

A Large, Crowdsourced Evaluation of Gesture Generation Systems on Common Data: The GENEA Challenge 2020

SGToolkit: An Interactive Gesture Authoring Toolkit for Embodied Conversational Agents

Contact Info

Product

Resources

About