Proceedings of the Second International Conference on Human Language Technology Research - 2002
DOI: 10.3115/1289189.1289268
|View full text |Cite
|
Sign up to set email alerts
|

Relevance models for topic detection and tracking

Abstract: We extend relevance modeling to the link detection task of Topic Detection and Tracking (TDT) and show that it substantially improves performance. Relevance modeling, a statistical language modeling technique related to query expansion, is used to enhance the topic model estimate associated with a news story, boosting the probability of words that are associated with the story even when they do not appear in the story. To apply relevance modeling to TDT, it had to be extended to work with stories rather than s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
54
0
2

Year Published

2003
2003
2023
2023

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 70 publications
(58 citation statements)
references
References 10 publications
(16 reference statements)
0
54
0
2
Order By: Relevance
“…In order to identify the effects of the number of index terms on the match performance, tests were repeated for 1, 2, 3, 4,5,10,15,20,25,30,35,40,45,50,55,60,65,70,75,80,85,90,95,100,125,150,175,200,225,250,275, 300, 400, 500 and 1000 terms respectively. In addition, logical operators AND, and OR were applied on the results obtained through the use of VSM and RM methods so that the effects of Boolean operators AND and OR on precision and recall measures can be seen.…”
Section: Testingmentioning
confidence: 99%
See 2 more Smart Citations
“…In order to identify the effects of the number of index terms on the match performance, tests were repeated for 1, 2, 3, 4,5,10,15,20,25,30,35,40,45,50,55,60,65,70,75,80,85,90,95,100,125,150,175,200,225,250,275, 300, 400, 500 and 1000 terms respectively. In addition, logical operators AND, and OR were applied on the results obtained through the use of VSM and RM methods so that the effects of Boolean operators AND and OR on precision and recall measures can be seen.…”
Section: Testingmentioning
confidence: 99%
“…• Story Link Detection; to distinguish if the two different stories are on the same subject or not. In the TDT studies, the story link detection task is reported to have a critical role [5][6] [9]. Carrying out the story link detection task successfully is expected to solve many problems in TDT [10].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Because of that, the clarity score method has been widely used for query performance prediction in the area. Some applications include query expansion (anticipating poorly performing queries as good candidates to be expanded), rank fusion, link extraction in topic detection and tracking [15], and document segmentation [8]. A prolific sequel of variants and enhancements on the notion of clarity followed the original works [8,11].…”
Section: Introductionmentioning
confidence: 99%
“…Because of that, the clarity score method has been widely used for query performance prediction in the area. Some applications include query expansion (anticipating poorly performing queries as good candidates to be expanded), rank aggregation, link extraction in topic detection and tracking [16], and document segmentation [8]. A prolific sequel of variants and enhancements on the notion of clarity followed the original works [8,11].…”
Section: Introductionmentioning
confidence: 99%