2005
DOI: 10.1016/j.datak.2004.11.004
|View full text |Cite
|
Sign up to set email alerts
|

Clustering Web pages based on their structure

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
47
0
1

Year Published

2009
2009
2017
2017

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 61 publications
(48 citation statements)
references
References 24 publications
0
47
0
1
Order By: Relevance
“…All these sources provide basic qualitative data, and are often used for reference or annotation in more specialized domains. We use a simple random sample of input values for search forms of these sites in orde r based on httpUnit 9 w lues.…”
Section: Results and Evaluationmentioning
confidence: 99%
See 3 more Smart Citations
“…All these sources provide basic qualitative data, and are often used for reference or annotation in more specialized domains. We use a simple random sample of input values for search forms of these sites in orde r based on httpUnit 9 w lues.…”
Section: Results and Evaluationmentioning
confidence: 99%
“…2. We observe the concept of link-collection [9], which refers to anchor links in a page (class) that share the same path in the DOM tree, from the root element to their parent or grandparent element. As a result, these hyperlinks appear grouped together in the rendered page.…”
Section: Site-wide Wrapper Inductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Based on pattern vectors, similarity between pages is defined. To automatically extract main classes of pages offered by a website, [4] compares structures of DOM trees. In order to improve search results via text contents [1] uses the path length and [7] uses weighted path between two pages to adjust clusters.…”
Section: Related Workmentioning
confidence: 99%