2006
DOI: 10.1016/j.datak.2006.01.010
|View full text |Cite
|
Sign up to set email alerts
|

Clustering e-commerce search engines based on their search interface pages using WISE-Cluster

Abstract: In this paper, we propose a new approach to clustering e-commerce search engines (ESEs) on the Web. Our approach utilizes the features available on the interface page of each ESE, including the label terms and value terms appearing in the search form, the number of images, normalized price terms as well as other terms. The experimental results based on more than 400 ESEs indicate that the proposed approach has good clustering accuracy. The importance of different types of features is analyzed and the terms in … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
18
1

Year Published

2009
2009
2013
2013

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 14 publications
(19 citation statements)
references
References 19 publications
0
18
1
Order By: Relevance
“…Second, as mentioned in Section 1, the works of He et al (2004), Lu et al (2006) and Barbosa et al (2007), together with this paper, are on the same topic of SWS categorization. While the prior works show that it is feasible to determine the domain of a source by using discriminative features extracted from source's search interfaces, we take a step further.…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…Second, as mentioned in Section 1, the works of He et al (2004), Lu et al (2006) and Barbosa et al (2007), together with this paper, are on the same topic of SWS categorization. While the prior works show that it is feasible to determine the domain of a source by using discriminative features extracted from source's search interfaces, we take a step further.…”
Section: Related Workmentioning
confidence: 99%
“…Similar to other studies Lu et al, 2006;Barbosa et al, 2007), we assume that one web database, together with its search interfaces, belongs to one category. Since a document representing a web source belongs to one category, and there are multiple categories, our problem is equivalent to a single-label multiclass text categorization problem (Sebastiani, 2002).…”
Section: Classification Processmentioning
confidence: 99%
See 3 more Smart Citations