Proceedings of the Fifth ACM International Conference on Web Search and Data Mining 2012
DOI: 10.1145/2124295.2124324
|View full text |Cite
|
Sign up to set email alerts
|

Topical clustering of search results

Abstract: Search results clustering (SRC) is a challenging algorithmic problem that requires grouping together the results returned by one or more search engines in topically coherent clusters, and labeling the clusters with meaningful phrases describing the topics of the results included in them.In this paper we propose to solve SRC via an innovative approach that consists of modeling the problem as the labeled clustering of the nodes of a newly introduced graph of topics. The topics are Wikipedia-pages identified by m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
71
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 88 publications
(71 citation statements)
references
References 24 publications
0
71
0
Order By: Relevance
“…TAG MY SEARCH [5] is an example of Wikipedia-based topic discovery applied to a related task of general Web search result clustering. Unlike ScienScan, TAG MY SEARCH uses only articles but not categories of Wikipedia to represent topics, and thus performs flat rather then hierarchical grouping of the search results.…”
Section: Related Work and Discussionmentioning
confidence: 99%
“…TAG MY SEARCH [5] is an example of Wikipedia-based topic discovery applied to a related task of general Web search result clustering. Unlike ScienScan, TAG MY SEARCH uses only articles but not categories of Wikipedia to represent topics, and thus performs flat rather then hierarchical grouping of the search results.…”
Section: Related Work and Discussionmentioning
confidence: 99%
“…They use the K Nearest Neighbor (KNN) algorithm to find keyword clusters and then form document clusters by their similarity with each keyword cluster but they do not have statistical analysis on cluster labeling. Scaiella et al use a Wikipedia annotator TAGME to find the Wikipedia page titles associated with each document snippet [21]. In their keyword graph, a node is a Wikipedia page title (topic), the edge weights are the topic-to-topic similarities computed based on the Wikipedia linked-structure.…”
Section: A Review Of Atg Approachesmentioning
confidence: 99%
“…This algorithm automatically detects the number of communities and generates compact taxonomies. It has an advantage over existing commercial systems such as carrotsearch.com and Yippy, and also some most recent works since these methods partition the document collection to about 10 clusters which is not always the real number of topics [2] [21]. While many state-of-the-art search result clustering algorithms are flat, our method applies the Fast Modularity algorithm recursively in a top-down manner until certain conditions are reached.…”
Section: Phase Iii: Community Miningmentioning
confidence: 99%
“…However, it has been applied to clustering snippets resulting from Web search [19]. Each result snippet is annotated using TAGME 8 .…”
Section: Introductionmentioning
confidence: 99%