2009
DOI: 10.1007/s11042-009-0400-y
|View full text |Cite
|
Sign up to set email alerts
|

Thesaurus enrichment for query expansion in audiovisual archives

Abstract: It is common practice in audiovisual archives to disclose documents using metadata from a structured vocabulary or thesaurus. Many of these thesauri have limited or no structure. The objective of this paper is to find out whether retrieval of audiovisual resources from a collection indexed with an in-house thesaurus can be improved by enriching the thesaurus structure. We propose a method to add structure to a thesaurus by anchoring it to an external, semantically richer thesaurus. We investigate the added val… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…We say that this is undoubtedly one of the major reasons for a system's poor performance, especially as the outcome of subject searching [3,4]. Agreeing with [5] we also suggest that the Subject facet is the only facet with semantic relations between its terms, making it the most suitable facet for our method and experimental setup.…”
Section: Related Workmentioning
confidence: 80%
“…We say that this is undoubtedly one of the major reasons for a system's poor performance, especially as the outcome of subject searching [3,4]. Agreeing with [5] we also suggest that the Subject facet is the only facet with semantic relations between its terms, making it the most suitable facet for our method and experimental setup.…”
Section: Related Workmentioning
confidence: 80%
“…In [18], the enrichment consists of adding terms to a source thesaurus by exploiting parallel corpora. In [30], WordNet is used to establish hierarchical relations between the source terms. Oliveira and Gomes [47] propose a method to automatically enrich a Portuguese thesaurus with synonyms extracted from dictionaries.…”
Section: Terminology-related Technologymentioning
confidence: 99%
“…This module allows to obtain a list of the most representative terms from a given corpus. After analysing and testing several open source automatic term extraction (ATE) tools, and also proprietary software, as mentioned in Section 2, we chose to implement the TBXTools service 30 [48]. TBXTools is a fast and flexible tool that offers statistical and linguistic approaches to term extraction.…”
Section: Module 1: Terminology Extractionmentioning
confidence: 99%
“…Hierarchical mesh segmentation method [4] which summarizes the conventional way of achieving semantic annotation is adopted by a system that collects personnel and company information via natural language processing, as well as many other applications that focus on ontological annotation of various kinds of documents. Hollink [12] annotates paintings based on features such as artist name, painting style, etc. However, because this system annotates directly to the image document, it lacks the ability to query semantic features from the paintings.…”
Section: Annotation Based On Semantic Relationshipmentioning
confidence: 99%