2004
DOI: 10.1007/978-3-540-24752-4_3
|View full text |Cite
|
Sign up to set email alerts
|

A User-Centered Approach to Evaluating Topic Models

Abstract: Abstract. This paper evaluates the automatic creation of personal topic models using two language model-based clustering techniques. The results of these methods are compared with user-defined topic classes of web pages from personal web browsing histories from a 5-week period. The histories and topics were gathered during a naturalistic case study of the online information search and use behavior of two users. This paper further investigates the effectiveness of using display time and retention behaviors as i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2006
2006
2019
2019

Publication Types

Select...
4
3
1

Relationship

3
5

Authors

Journals

citations
Cited by 17 publications
(8 citation statements)
references
References 14 publications
0
8
0
Order By: Relevance
“…They proposed a generative probabilistic mixture model for comparative text mining. Kelly et al [50] investigated the techniques used for topic clustering of documents and discovered that most methods performed poorly when evaluated according to the user-defined topic classes. Mei and Zhai [51] used statistical language models to perform a special temporal text mining task, which is studying the patterns of themes in text.…”
Section: Text Classification and Categorization Modelsmentioning
confidence: 99%
See 1 more Smart Citation
“…They proposed a generative probabilistic mixture model for comparative text mining. Kelly et al [50] investigated the techniques used for topic clustering of documents and discovered that most methods performed poorly when evaluated according to the user-defined topic classes. Mei and Zhai [51] used statistical language models to perform a special temporal text mining task, which is studying the patterns of themes in text.…”
Section: Text Classification and Categorization Modelsmentioning
confidence: 99%
“…TDT consists of five tasks, topic 742 H. C. OZMUTLU tracking, link detection, topic detection, first story detection and story segmentation [56]. Topic modeling has been performed significantly in the context of topic detection and tracking [50]. The significant works on TDT can be listed as those of Feng and Allan [56], Larkey et al [57], and Kumaran and Allan [58,59].…”
Section: Text Classification and Categorization Modelsmentioning
confidence: 99%
“…One problem with all of these approaches is that they are more system-or query-centered, than user-centered, and have not leveraged context. Although some systems have attempted to represent system-defined context through automatic clustering, such clusters do not necessarily reflect how users would cluster their documents [17,22].…”
Section: Introductionmentioning
confidence: 99%
“…In addition, retention (or saving) has been found to be potentially useful as implicit evidence for identifying documents that could be used as seeds for re-weighting (Kelly, Diaz, Belkin, & Allan, 2004). This is quite intuitive.…”
Section: Rlb Personalization Using Search Behaviorsmentioning
confidence: 99%