Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2005
DOI: 10.1145/1076034.1076067
|View full text |Cite
|
Sign up to set email alerts
|

Using ODP metadata to personalize search

Abstract: The Open Directory Project is clearly one of the largest collaborative efforts to manually annotate web pages. This effort involves over 65,000 editors and resulted in metadata specifying topic and importance for more than 4 million web pages. Still, given that this number is just about 0.05 percent of the Web pages indexed by Google, is this effort enough to make a difference? In this paper we discuss how these metadata can be exploited to achieve high quality personalized web search. First, we address this b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
140
0
4

Year Published

2005
2005
2019
2019

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 206 publications
(144 citation statements)
references
References 20 publications
0
140
0
4
Order By: Relevance
“…User profiles can be represented by a weighted term vector [7], weighted concept hierarchical structures [10] [12] like ODP 3 , or other implicit user interest hierarchy [11]. For the purposes of selectively exposing users' interests to search engines, the user profile is a term based hierarchical structure that is related to frequent term based clustering algorithms [16] [17].…”
Section: Related Workmentioning
confidence: 99%
“…User profiles can be represented by a weighted term vector [7], weighted concept hierarchical structures [10] [12] like ODP 3 , or other implicit user interest hierarchy [11]. For the purposes of selectively exposing users' interests to search engines, the user profile is a term based hierarchical structure that is related to frequent term based clustering algorithms [16] [17].…”
Section: Related Workmentioning
confidence: 99%
“…8, where each point represents the accuracy of the classifiers (trained with unweighted boolean vectors) when considering only their results for pages with a minimum np. We see that both the J48 and NaiveBayes are better at clas-sifying popular pages, and that this is not a bias in the data, since the random classifier does not share this property 3 . We suggest that there is a correlation between popularity and purely structural similarity, which may account for the fact that weighting the instances does not significantly improve the accuracy -since this is information that the classifiers learn anyway.…”
Section: A Weighted Stak Classifiermentioning
confidence: 93%
“…However, new tools are also needed to gather, harness, reuse and share, in the most efficient and enjoyable way, the experiences captured by UGC [4,15]. One particular line of research has focused on using recommendation technologies in an effort to make Web search more personal: by learning about the preferences and interests of individual searchers, personalized Web search systems can influence search results in a manner that better suits the individual searcher [3,18]. Recently, another complementary research direction has seen researchers explore the collaborative potential of Web search by proposing that the conventional solitary nature of Web search can be enhanced in many search scenarios by recognising and supporting the sharing of search experiences to facilitate synchronous or asynchronous collaboration among searchers [12,8].…”
Section: Introductionmentioning
confidence: 99%
“…Most re-ranking strategies attempt to construct a user profile from the user historical behaviors, and use the profile to filter out resources unmatched with his/her interests. [11] modeled both user profiles and resources as topic vectors from ODP hierarchy, thus the matching between user interest and content can be measured by their vector distance. A personalized PageRank algorithm was proposed in [12], which was a modification to the global PageRank on Web, and the search results were personalized based on the hyperlink structure.…”
Section: Personalized Searchmentioning
confidence: 99%