2011
DOI: 10.1007/s10115-011-0421-5
|View full text |Cite
|
Sign up to set email alerts
|

Using structural similarity for clustering XML documents

Abstract: In this paper, we describe a method for clustering XML documents. Its goal is to group documents sharing similar structures. Our approach is two-step. We first automatically extract the structure from each XML document to be classified. This extracted structure is then used as a representation model to classify the corresponding XML document. The idea behind the clustering is that if XML documents share similar structures, they are more likely to correspond to the structural part of the same query. Finally, fo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(5 citation statements)
references
References 37 publications
0
5
0
Order By: Relevance
“…Here, clustering represents merging of similar types of XML data & applications of XML clustering are: information retrieval, data integration, document ranking, web mining as well as query processing. The major issues in XML data preprocessing for ranking are given below [2] :…”
Section: Proposed Modelmentioning
confidence: 99%
See 1 more Smart Citation
“…Here, clustering represents merging of similar types of XML data & applications of XML clustering are: information retrieval, data integration, document ranking, web mining as well as query processing. The major issues in XML data preprocessing for ranking are given below [2] :…”
Section: Proposed Modelmentioning
confidence: 99%
“…The identification of a new TREC documents comes along with two vital tasks. The first task is the problem of identification of features in the TREC training data [2][3][4]. The second task is called the feature based document clustering and classification.…”
Section: Introductionmentioning
confidence: 99%
“…As X3D expresses the geometry and behaviour capabilities of VRML using XML [25] which has become an unchallenged standard for the representation and exchange of data on the web [26], X3D documents must also follow the rules of writing used in XML. Nevertheless, X3D documents can be written using a writing style as used in classic VRML encoding [27], so developers who are more familiar with the VRML writing style can choose this way.…”
Section: Web3d Standardsmentioning
confidence: 99%
“…An approach similar to S-GRACE was presented by Aïtelhadj et al (2012). The authors propose to transform XML documents into tree summaries by merging all repeating elements at each level of a document into a single node.…”
Section: Substructural Similarity Approachesmentioning
confidence: 99%