2004
DOI: 10.1007/978-3-540-30222-3_47
|View full text |Cite
|
Sign up to set email alerts
|

Creating the DISEQuA Corpus: A Test Set for Multilingual Question Answering

Abstract: This paper describes the procedure adopted by the three co-ordinators of the CLEF 2003 question answering track (ITC-irst, UNED and ILLC) to create the question set for the monolingual tasks. Despite the little resources available, the three groups collaborated and managed to formulate and verify a large pool of original questions posed in three different languages: Dutch, Italian and Spanish. A part of these queries was translated into English and shared between the three coordination groups. Thus, a second c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2004
2004
2022
2022

Publication Types

Select...
7
2

Relationship

2
7

Authors

Journals

citations
Cited by 13 publications
(8 citation statements)
references
References 1 publication
0
6
0
Order By: Relevance
“…Corpora developed for multilingual and crosslingual question-answering (QA), information retrieval (IR), and information extraction (IE) tasks are typically compilations of documents on related subjects written in different languages. Documents in such corpora rarely have counterparts in all the languages presented in the corpus (CLEF, 2000;Magnini et al, 2003).…”
Section: Related Workmentioning
confidence: 99%
“…Corpora developed for multilingual and crosslingual question-answering (QA), information retrieval (IR), and information extraction (IE) tasks are typically compilations of documents on related subjects written in different languages. Documents in such corpora rarely have counterparts in all the languages presented in the corpus (CLEF, 2000;Magnini et al, 2003).…”
Section: Related Workmentioning
confidence: 99%
“…The data set used in this work consists of the questions provided in the DISEQuA Corpus [10]. Such corpus was made up of simple, mostly short, straightforward and factual queries that sound naturally spontaneous, and arisen from a real desire to know something about a particular event or situation.…”
Section: Data Setsmentioning
confidence: 99%
“…The data set used in this work consists of the questions provided in the DISEQuA Corpus (Magnini et al, 2003). Such corpus was made up of simple, mostly short, straightforward and factual queries that sound naturally spontaneous, and arisen from a real desire to know something about a particular event or situation.…”
Section: Data Setsmentioning
confidence: 99%