Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003)
DOI: 10.1109/wi.2003.1241281
|View full text |Cite
|
Sign up to set email alerts
|

Bidirectional hierarchical clustering for Web mining

Abstract: Abstract

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0
1

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 18 publications
0
7
0
1
Order By: Relevance
“…As similar terms will be clustered into the same node, outliers will eventually be singled out as individual terms in leaf nodes. Consequently, unlike some conventional methods such as K-means (Yao and Choi 2003), clustering using TTA is not susceptible to poor results due to outliers. In fact, there are two ways of looking at the term "Google", one as result in two clusters, three clusters, and five clusters, respectively.…”
Section: Evaluations and Discussionmentioning
confidence: 99%
“…As similar terms will be clustered into the same node, outliers will eventually be singled out as individual terms in leaf nodes. Consequently, unlike some conventional methods such as K-means (Yao and Choi 2003), clustering using TTA is not susceptible to poor results due to outliers. In fact, there are two ways of looking at the term "Google", one as result in two clusters, three clusters, and five clusters, respectively.…”
Section: Evaluations and Discussionmentioning
confidence: 99%
“…Some of the well‐known clustering methods include partitioning algorithms based on dividing entire data into dissimilar groups, hierarchical methods, density‐ and grid‐based clustering approaches, and graph‐based methods (Barthelemy and Leclerc 1995; Jain et al 1999; Grira, Crucianu, and Boujemaa 2005; Xu and Wunsch 2005). Due to the high dimensionality and lack of orthogonality of Web document vectors, algorithms such as K‐Means (McQueen 1967), hierarchical agglomerative clustering (HAC) (Day and Edelsbrunner 1985), and graph partitioning methods have gained more popularity and applicability in the Web environment (Yao and Choi 2003).…”
Section: Related Workmentioning
confidence: 99%
“…Oftentimes, some sort of hybrid algorithm is developed for a particular application. Real-world applications of hierarchical clustering can be found in the following references: speech recognition [14], web mining [15], lung cancer research [16], and document mining [17]. A wide variety of techniques are presented in these papers.…”
Section: Figurementioning
confidence: 99%