A dataset for Movie Description

This paper aims to pre-study on finding events embedded in recent video datasets and transforming them into verbs. To this end, we need to look over conventional video datasets for human action and activity and then analyze the events embedded in video datasets. Finally we should also allow for transformation from events to verbs. As an early stage for this purpose, we investigate conventional and recently available visual datasets and analyze activities or actions embedded in those datasets in this paper

show abstract

“…Several datasets [3]- [10] for action recognition are designed and opened to the public but some of them are not suitable for realistic events.…”

Section: Video Datasets For Human Action and Activitymentioning

confidence: 99%

Investigation and Review of Embedded Events in Public Video Datasets

Kang

Kwon

Moon

et al. 2015

2015 8th International Conference on Signal Processing, Image Processing and Pattern Recognition (SIP)

View full text Add to dashboard Cite

show abstract

“…Automatic Multimodal Content Analysis (AMCA), on the other hand, consists of computer-driven detection of visual and auditory elements from multimedia (Rohrbach & al 2015;Viitaniemi & al 2015). AMCA is cost-effective and produces consistent output, but is still insufficient for high-level semantic analysis.…”

Section: Ad Vs Amcamentioning

confidence: 99%

Towards Reliable Automatic Multimodal Content Analysis

Lautenbacher¹,

Tiittula²,

Hirvonen³

et al. 2015

Proceedings of the Fourth Workshop on Vision and Language

View full text Add to dashboard Cite

show abstract

“…6 we present and discuss the results of the LSMDC 2015 andLSMDC 2016. This work is partially based on the original publications from Rohrbach et al (2015c, b) and the technical report from Torabi et al (2015). Torabi et al (2015) collected M-VAD, Rohrbach et al (2015c) collected the MPII-MD dataset and presented the translation-based description approach. Rohrbach et al (2015b) proposed the VisualLabels approach.…”

Section: Figmentioning

confidence: 99%

“…(c) Focusing on more "visual" labels helps: we reduce the LSTM input dimensionality to 263 while improving the performance. (Rohrbach et al 2014), and showed the comparable performance to manually annotated SRs, see Rohrbach et al (2015c). In the following we use the best performing "Visual Labels" approach, Table 8, line (8).…”

Section: Robust Visual Classifiersmentioning

confidence: 99%

“…The Large Scale Movie Description Challenge (LSMDC) is based on two datasets which were originally collected independently. The MPII Movie Description Dataset (MPII-MD), initially presented by Rohrbach et al (2015c), was collected from Blu-ray movie data. It consists of AD and script data and uses sentence-level manual alignment of transcribed audio to the actions in the video (Sect.…”

Section: Datasets For Movie Descriptionmentioning

confidence: 99%

See 1 more Smart Citation

Movie Description

et al. 2017

Self Cite

View full text Add to dashboard Cite

Audio description (AD) provides linguistic descriptions of movies and allows visually impaired people to follow a movie along with their peers. Such descriptions are by design mainly visual and thus naturally form an interesting data source for computer vision and computational linguistics. In this work we propose a novel dataset which contains transcribed ADs, which are temporally aligned to full length movies. In addition we also collected and aligned movie scripts used in prior work and compare the two sources of descriptions. We introduce the Large Scale Movie Description Challenge (LSMDC) which contains a parallel corpus of 128,118 sentences aligned to video clips from 200 movies (around 150 h of video in total).

show abstract

A dataset for Movie Description

Cited by 356 publications

References 58 publications

Investigation and Review of Embedded Events in Public Video Datasets

Investigation and Review of Embedded Events in Public Video Datasets

Towards Reliable Automatic Multimodal Content Analysis

Movie Description

Contact Info

Product

Resources

About