2010
DOI: 10.1007/978-3-642-15751-6_46
|View full text |Cite
|
Sign up to set email alerts
|

Overview of VideoCLEF 2009: New Perspectives on Speech-Based Multimedia Content Enrichment

Abstract: VideoCLEF 2009 offered three tasks related to enriching video content for improved multimedia access in a multilingual environment. For each task, video data (Dutch-language television, predominantly documentaries) accompanied by speech recognition transcripts were provided. The Subject Classification Task involved automatic tagging of videos with subject theme labels. The best performance was achieved by approaching subject tagging as an information retrieval task and using both speech recognition transcripts… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2010
2010
2024
2024

Publication Types

Select...
6
2

Relationship

3
5

Authors

Journals

citations
Cited by 23 publications
(11 citation statements)
references
References 14 publications
0
11
0
Order By: Relevance
“…For the first challenge, two related tasks have been considered at the CLEF evaluation campaigns. VideoCLEF'09 [120] featured a task aimed at linking video content to related resources across languages. This task was framed as a known-item-task, where noisy ASR for Dutch was used to produce links to target English Wikipedia articles.…”
Section: Link Generationmentioning
confidence: 99%
See 1 more Smart Citation
“…For the first challenge, two related tasks have been considered at the CLEF evaluation campaigns. VideoCLEF'09 [120] featured a task aimed at linking video content to related resources across languages. This task was framed as a known-item-task, where noisy ASR for Dutch was used to produce links to target English Wikipedia articles.…”
Section: Link Generationmentioning
confidence: 99%
“…The first step of our approach to real-time entity linking generates link candidates and rank these candidates initially. The second step is aimed at improving precision using a learning to rerank approach, that was effective on similar tasks [120,141,149]. For link candidates many ranking criteria are in play, making learning to rerank particularly appropriate.…”
Section: Learning To Rerankmentioning
confidence: 99%
“…One of the major goals of automatic classification research is to reproduce the classification that would be generated by a human, given a specific classification scheme. For example, in [213], classes are drawn from the Media Topic Taxonomy of the International Press Telecommunications Council, 4 in [159] classes are drawn from the thesaurus used by archivists at the Netherlands Institute for Sound and Vision, a large audio-video archive, and in [161] classes are drawn from the set of tags assigned by users to video on blip.tv, a video sharing platform. Note that some forms of classification that are potentially useful for SCR perform their categorization of speech media based on characteristics not directly related to topic, such as genre in video [267] or dialogue acts in spoken conversation (i.e., statement, question, apology) [259].…”
Section: Extracting Topic Informationmentioning
confidence: 99%
“…Henk van Os is highly acclaimed and appreciated in the Netherlands, where he has established his ability to appeal to a broad audience. 4 Constraining the corpus to contain episodes from Beeldenstorm limits the spoken content to a single speaker speaking within the style of a single documentary series. This limitation is imposed in order to help control effects that could be introduced by variability in style or skill.…”
Section: Data Set and Task Definitionmentioning
confidence: 99%
“…Up to five different runs (i.e., system outputs created according to different experimental conditions) could be submitted. Further details about the data set and the Affect Detection task for VideoCLEF 2009 can be found in the track overview paper [4]. Participants were provided with additional resources accompanying the test data, including transcripts generated by an automatic speech recognition system [3].…”
Section: Data Set and Task Definitionmentioning
confidence: 99%