Proceedings of the 18th ACM Conference on Information and Knowledge Management 2009
DOI: 10.1145/1645953.1646117
|View full text |Cite
|
Sign up to set email alerts
|

Retrieval experiments using pseudo-desktop collections

Abstract: Desktop search is an important part of personal information management (PIM). However, research in this area has been limited by the lack of shareable test collections, making cumulative progress difficult. In this paper, we define desktop search as a semi-structured document retrieval problem and introduce a methodology to automatically build a reusable collection (the pseudo-desktop) that has many of the same properties as a real desktop collection.We then present a comprehensive evaluation of retrieval meth… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
56
0

Year Published

2011
2011
2023
2023

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 34 publications
(56 citation statements)
references
References 21 publications
0
56
0
Order By: Relevance
“…This use-case is representative of situations where the indexing documents have a rich content (tens to hundreds of thousands of terms) and documents updates and deletes can be performed randomly. To capture the behavior of our solution in such context, we use the pseudo-desktop data collection and query set provided in [11] which is considered as representative of a personal desktop where searches, updates and deletes are performed.…”
Section: Experimental Evaluation 71 Experimental Setupmentioning
confidence: 99%
See 2 more Smart Citations
“…This use-case is representative of situations where the indexing documents have a rich content (tens to hundreds of thousands of terms) and documents updates and deletes can be performed randomly. To capture the behavior of our solution in such context, we use the pseudo-desktop data collection and query set provided in [11] which is considered as representative of a personal desktop where searches, updates and deletes are performed.…”
Section: Experimental Evaluation 71 Experimental Setupmentioning
confidence: 99%
“…The desktop search is an important topic in the IR community, but real personal collections of desktop files cannot be published for evident privacy issues. Instead, the authors in [11] propose a method to generate pseudo desktop collections and show that such collections have the same properties as real collections. As recommended in [11], we preprocess the files in this collection by removing the stop words and stemming the remaining terms using the Krovetz stemmer.…”
Section: Experimental Evaluation 71 Experimental Setupmentioning
confidence: 99%
See 1 more Smart Citation
“…Another related research area involves the extraction of representative words from a document. Research on knownitem search [4,14] constructs (query, document) pairs by extracting important words from a document and formulating pseudo queries in a desktop search environment. This work, however, chooses words from a document based on some basic statistical indicators, which risks losing informative phrases and could be too simple to generate queries for long documents of 5000 words.…”
Section: Related Workmentioning
confidence: 99%
“…This latter issue creates problems for all aspects of the evaluation of search of personal collections. Current work on evaluation of personal collection search is exploring the development of simulated personal Cranfield type search test collections [2]. However, these collections do not represent the diversity of real users collections, items selected by an individual owning the collection that they actually want to retrieve from it, nor the query terms collection owners will use.…”
Section: Introductionmentioning
confidence: 99%