Proceedings of the 2007 ACM Symposium on Applied Computing 2007
DOI: 10.1145/1244002.1244196
|View full text |Cite
|
Sign up to set email alerts
|

On using user query sequence to detect off-topic search

Abstract: Retrieving off-topic documents to a user's pre-defined area of interest via a search engine is potentially a violation of access rights and is a concern to every private, commercial, and governmental organization. We improve content-based off-topic search detection approaches by using a sequence of user queries versus the individual queries. In this approach, we reevaluate how off-topic a query is, based on the sequence of queries that preceded it. Our empirical results show that using the information from the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2007
2007
2009
2009

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(6 citation statements)
references
References 4 publications
0
6
0
Order By: Relevance
“…Recall measures the rate of detection, while precision is defined as the ratio of the cases detected correctly as off-topic to the total of the true and false detections. Similar to [11], we use true negatives to calculate precision and recall, as the objective of our method is to decrease false positives and increase true negatives. The F1-measure is a harmonic mean of precision and recall.…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…Recall measures the rate of detection, while precision is defined as the ratio of the cases detected correctly as off-topic to the total of the true and false detections. Similar to [11], we use true negatives to calculate precision and recall, as the objective of our method is to decrease false positives and increase true negatives. The F1-measure is a harmonic mean of precision and recall.…”
Section: Discussionmentioning
confidence: 99%
“…User profiles are of interest to many areas of research, including personalization [15,16,18], peer-to-peer information retrieval [9], malicious data access detection [4], and off-topic search detection [2]. The user profile may be represented in various ways, including (but not limited to) bag of words derived from persumably valid user queries [5,6], bag of words that map to the legitimate user behavior or organizational tasks [7,11], or topics (categories) mapping to the legitimate user interests [13,14]. Similarly, in this work we assume that the user profiles consist of one or more topics based on user's interest.…”
Section: Prior Workmentioning
confidence: 99%
See 3 more Smart Citations