Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD '03 2003
DOI: 10.1145/956804.956822
|View full text |Cite
|
Sign up to set email alerts
|

A bag of paths model for measuring structural similarity in Web documents

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
26
0
3

Year Published

2005
2005
2012
2012

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 14 publications
(29 citation statements)
references
References 0 publications
0
26
0
3
Order By: Relevance
“…(a) Featurebased: it counts the number of common features in graphs, namely, domain-specific elementary structures, e.g., root-leaf paths [18]. (b) Structure-based: it assesses the similarity of the topology of graphs based on simulation [17,12], subgraph isomorphism (common maximum subgraph) [27,30], or edit distance [31] (see [9,26] for surveys).…”
Section: Related Workmentioning
confidence: 99%
“…(a) Featurebased: it counts the number of common features in graphs, namely, domain-specific elementary structures, e.g., root-leaf paths [18]. (b) Structure-based: it assesses the similarity of the topology of graphs based on simulation [17,12], subgraph isomorphism (common maximum subgraph) [27,30], or edit distance [31] (see [9,26] for surveys).…”
Section: Related Workmentioning
confidence: 99%
“…A different source used to determine similarity is the structural layout of pages [3,8,14,27]. The rationale of structure-based approaches is that pages containing similar information would also have a similar structure [27], or a similar layout and look-and-feel [14].…”
Section: Related Workmentioning
confidence: 99%
“…The rationale of structure-based approaches is that pages containing similar information would also have a similar structure [27], or a similar layout and look-and-feel [14]. An approach that has utilised tag frequency information from web pages to determine their similarity is reported in [3].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations