Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2001
DOI: 10.1145/383952.383968
|View full text |Cite
|
Sign up to set email alerts
|

Evaluating a probabilistic model for cross-lingual information retrieval

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
120
2

Year Published

2003
2003
2012
2012

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 91 publications
(125 citation statements)
references
References 15 publications
3
120
2
Order By: Relevance
“…For example, the Arabic word "£ Y ¤¥ " can be translated as "bread" or "bake," and equation (1) would (with proper stemming) reward an occurrence of "baking bread." Corpus-based approaches to CLIR have generally developed within a framework based on language modeling rather than vector space models, at least in part because modern statistical translation frameworks offer a natural way of integrating translation and language models [18]. In general, language modeling approaches to retrieval rely on collection frequency (CF) in place of DF: 2 …”
Section: Introductionmentioning
confidence: 99%
“…For example, the Arabic word "£ Y ¤¥ " can be translated as "bread" or "bake," and equation (1) would (with proper stemming) reward an occurrence of "baking bread." Corpus-based approaches to CLIR have generally developed within a framework based on language modeling rather than vector space models, at least in part because modern statistical translation frameworks offer a natural way of integrating translation and language models [18]. In general, language modeling approaches to retrieval rely on collection frequency (CF) in place of DF: 2 …”
Section: Introductionmentioning
confidence: 99%
“…(Hiemstra and de Jong, 1999;Berger and Lafferty, 1999;Xu, et al, 2001;Federico and Bertoldi, 2002). Our crosslingual work is based on a widely-used extension of the above monolingual model (Xu et al, 2001). To generate a query from a document in a different language, say Arabic, one samples either the Arabic document, or the English background model.…”
Section: Language Modelsmentioning
confidence: 99%
“…The cross-language relevance model is given in Equation (12), which is the same as Equation (10), above, but now subscripts indicate that the documents D a and terms w a are in Arabic, and the query terms e i are English: (12) P(w a |D a ) can still be estimated as in Equation (11), with the document and background models in Arabic. P(e i |D a ) is estimated as in Berger and Lafferty (1999) and Xu, et al, (2001):…”
Section: Query Expansion and Relevance Modelingmentioning
confidence: 99%
See 2 more Smart Citations