2013
DOI: 10.1007/s10791-013-9225-4
|View full text |Cite
|
Sign up to set email alerts
|

A nonparametric term weighting method for information retrieval based on measuring the divergence from independence

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 24 publications
(11 citation statements)
references
References 29 publications
0
10
0
Order By: Relevance
“…In this subsection, we compare the empirical threshold of 70% with 10% by performing an extrinsic evaluation on the ClueWeb09 dataset using four different term-weighting models, namely BM25 [21], DFI [22], LGD [23] and DPH [24]. We also include a risk-sensitive evaluation similar to that of Gallagher et al [6], which takes into account not only the average effectiveness but also the per-query differences.…”
Section: Discussionmentioning
confidence: 99%
“…In this subsection, we compare the empirical threshold of 70% with 10% by performing an extrinsic evaluation on the ClueWeb09 dataset using four different term-weighting models, namely BM25 [21], DFI [22], LGD [23] and DPH [24]. We also include a risk-sensitive evaluation similar to that of Gallagher et al [6], which takes into account not only the average effectiveness but also the per-query differences.…”
Section: Discussionmentioning
confidence: 99%
“…The most correlated feature (AP and NDCG) is WMODEL.DFIZ_std which represents a weighting model based on the standardized distance from independence in term frequency [16]. WMODEL.ML2_std (second feature for AP) represents a weighting model based on multinomial randomness model, with Laplace after-e ect model and normalisation 2 [12]. WMODEL.In_expC2_max (second feature for NDCG) is the Inverse Expected Document Frequency weighting model with Bernoulli after-e ect and normalization [1].…”
Section: Individual E Ectivenessmentioning
confidence: 99%
“…vestigated the use of the Student's t-test for risk-sensitive evaluation, this is the first work to investigate the use of Pearson's Chi-square statistic for risk-sensitive evaluation, thereby facilitating the use of multiple baselines. Instead, previous usages of the Chi-square statistic has encompassed index term weighing [18] and document classification [22].…”
Section: Related Workmentioning
confidence: 99%
“…SIGIR '16, July [17][18][19][20][21]2016, Pisa, Italy. sure should consider per-query losses and gains compared to a particular baseline technique [11].…”
Section: Introductionmentioning
confidence: 99%