2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops 2010
DOI: 10.1109/cvprw.2010.5543575
|View full text |Cite
|
Sign up to set email alerts
|

A computer-vision-assisted system for Videodescription scripting

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2010
2010
2020
2020

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(12 citation statements)
references
References 11 publications
0
12
0
Order By: Relevance
“…Despite the potential benefit of DVS for computer vision, it has not been used so far apart from [25,42] who study how to automate DVS production. We believe the main reason for this is that it is not available in the text format, i.e.…”
Section: The Movie Description Datasetmentioning
confidence: 99%
“…Despite the potential benefit of DVS for computer vision, it has not been used so far apart from [25,42] who study how to automate DVS production. We believe the main reason for this is that it is not available in the text format, i.e.…”
Section: The Movie Description Datasetmentioning
confidence: 99%
“…3PlayMedia's post-production audio description tool [5] and Gagnon et al's audio description tool [18], both let authors provide text descriptions on videos, synthesizing text-to-speech for playback of the descriptions. Gagnon et al's tool also provides authors with timeline-based visualisations tailored to the production of cinematic audio descriptions including recognition of scenes, characters, and important locations [18]. While prior work suggests that people prefer human-narrated audio descriptions to speech-to-text audio descriptions when available [15,28], the aforementioned systems do not support narrated audio descriptions.…”
Section: Describing Videosmentioning
confidence: 99%
“…This process, however, could be facilitated by the use of semi-automated visual recognition techniques, which have been developed in different contexts (such as surveillance and video database indexing). An early example is VDManager [7], a VD editing software tool, which uses speech recognition as well as key-places and key-faces visual recognition.…”
Section: Conclusion and New Frontiersmentioning
confidence: 99%