2004
DOI: 10.1007/978-3-540-30116-5_51
|View full text |Cite
|
Sign up to set email alerts
|

A Tolerance Rough Set Approach to Clustering Web Search Results

Abstract: Extended AbstractTwo most popular approaches to facilitate searching for information on the web are represented by web search engine and web directories. Although the performance of search engines is improving every day, searching on the web can be a tedious and time-consuming task due to the huge size and highly dynamic nature of the web. Moreover, the user's "intention behind the search" is not clearly expressed which results in too general, short queries. Results returned by search engine can count from hun… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
23
0

Year Published

2005
2005
2017
2017

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 43 publications
(23 citation statements)
references
References 4 publications
0
23
0
Order By: Relevance
“…Another interesting algorithm that is inherently data-centric, but tries to create features comprehensible for users and brings us closer to the next section, is called tolerance rough set clustering (TRSC) [Ngo and Nguyen 2004]. In TRSC, the bag-of-words document representation is enriched with features possibly missing in a document and calculated using a tolerance rough set model computed over the space of all of the collection's terms.…”
Section: Cluster Construction and Labelingmentioning
confidence: 99%
“…Another interesting algorithm that is inherently data-centric, but tries to create features comprehensible for users and brings us closer to the next section, is called tolerance rough set clustering (TRSC) [Ngo and Nguyen 2004]. In TRSC, the bag-of-words document representation is enriched with features possibly missing in a document and calculated using a tolerance rough set model computed over the space of all of the collection's terms.…”
Section: Cluster Construction and Labelingmentioning
confidence: 99%
“…In this case, to answer that question, Ngo and Nguyen [16] proposed an approach to search for results clustering based on tolerance RS model following the work on document clustering. The application of tolerance RS model in document clustering was proposed as a way to enrich document and cluster representation to increase clustering performance.…”
Section: Related Workmentioning
confidence: 99%
“…Dimensionality Reduction or Feature subset selection is one of the important steps in data mining (Ahmed et al, 2009;Ngo and Nguyen, 2009;Selamat et al, 2010;Shylaja et al, 2010). Numerous features have been acquired and stored in databases due to the growth and development in real-time applications.…”
Section: Introductionmentioning
confidence: 99%
“…It is essential to reduce the dimensionality by selecting most relevant features which results in decreasing the measuring cost, transmission and storage cost and compact classification models. There are several techniques that have been proposed in the literature: Filter, wrapper and embedded (Selamat et al, 2010), unsupervised (Shylaja et al, 2010) and supervised (Ngo and Nguyen, 2009;Jensen and Shen, 2007) Rough set theory provides a mathematical tool that can be used for both feature selection and knowledge discovery (Jensen and Shen, 2007). It helps us to find out the minimal attribute sets called 'reducts' to classify objects without deterioration of classification quality and induce minimal length decision rules inherent in a given information system.…”
Section: Introductionmentioning
confidence: 99%