Proceedings of the 2014 International ACM Workshop on Crowdsourcing for Multimedia 2014
DOI: 10.1145/2660114.2660116
|View full text |Cite
|
Sign up to set email alerts
|

A Multi-task Learning Framework for Time-continuous Emotion Estimation from Crowd Annotations

Abstract: We propose Multi-task learning (MTL) for time-continuous or dynamic emotion (valence and arousal) estimation in movie scenes. Since compiling annotated training data for dynamic emotion prediction is tedious, we employ crowdsourcing for the same. Even though the crowdworkers come from various demographics, we demonstrate that MTL can effectively discover (1) consistent patterns in their dynamic emotion perception, and (2) the low-level audio and video features that contribute to their valence, arousal (VA) eli… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 33 publications
0
4
0
Order By: Relevance
“…Implementation details are given next. 1. Chapter 6 of [8] presented an aspect model with Bayesian active learning algorithm for regression problems and applied it to two movie rating applications, but it is single-task active learning instead of multi-task active learning.…”
Section: Single-task Gsymentioning
confidence: 99%
See 1 more Smart Citation
“…Implementation details are given next. 1. Chapter 6 of [8] presented an aspect model with Bayesian active learning algorithm for regression problems and applied it to two movie rating applications, but it is single-task active learning instead of multi-task active learning.…”
Section: Single-task Gsymentioning
confidence: 99%
“…Su et al [21] proposed MTL with low rank attribute embedding to perform person re-identification on multi-cameras, and demonstrated that it significantly outperformed existing single-task and multi-task approaches. Abadi et al [1] proposed MTL-based regression models to simultaneously learn the relationship between low-level audio-visual features and highlevel valence/arousal ratings from a collection of movie scenes. They can better predict valence and arousal ratings than scenespecific models.…”
Section: Introductionmentioning
confidence: 99%
“…This paper is an extension of the work presented in [2], where Multi-task learning (MTL) was employed to learn a crowd-based model for predicting dynamic arousal (A) and valence (V) levels in movie snippets. Given a set of related tasks, MTL simultaneously learns all tasks by modeling the similarities as well as differences among them to build task-specific classification or regression models.…”
Section: Introductionmentioning
confidence: 99%
“…In this work, we seek to improve the efficacy of crowdbased models by employing a small amount of expert data to guide the learning process-we employed 16 experts familiar with emotional attributes to provide dynamic A,V ratings for the clips annotated by crowdworkers 2 , and enhance crowd models employing expert knowledge via a novel expert-guided MTL (EG-MTL) algorithm. The EG-MTL algorithm seeks to simultaneously minimize the loss with respect to both crowd and expert labels in the optimization framework and learns a set of weights corresponding to each of the movie clips for which crowd annotations were sought.…”
Section: Introductionmentioning
confidence: 99%