A matching algorithm for measuring the structural similarity between an XML document and a DTD and its applications

Bertino, Elisa; Guerrini, Giovanna; Mesiti, Marco

doi:10.1016/s0306-4379(03)00031-0

Cited by 80 publications

(95 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“… D-factor underlines the semantic influence of node depth on XML semantic similarity. It follows the intuition that information placed near the root node of an XML document is more important than information further down in the hierarchy [6,90]. Thus, node labels higher in the XML tree hierarchy should have a greater semantic influence than their lower counterparts.…”

Section: Semantic Resemblance Between Sub-trees (Sem-rbs)mentioning

confidence: 75%

“…In general, element/attribute values are disregarded when evaluating the structural properties of heterogeneous XML documents (originating from different data-sources and not conforming to the same grammar), so as to perform XML structural classification/clustering [16,31,55,58] or structural querying (i.e., querying the structure of documents, disregarding content [6,64]). Nonetheless, values are usually taken into account with methods dedicated to XML change management [13,14], data integration [29,40], and XML structure-and-content querying applications [66,67], where documents tend to have similar structures (probably conforming to the same grammar [36,83]).…”

Section: Figmentioning

confidence: 99%

“…Recent XML structure-based methods in [6,64] identify the need to support tag similarity (synonyms and stems) instead of tag syntactic equality while comparing XML documents. In [42], the authors introduce a structure and content based method for comparing XML documents having the same grammar (i.e., not heterogeneous), and consider semantic similarity evaluation between element/attribute values, using a variation of the edge-based methods.…”

Section: Integrating Structural and Semantic Similaritymentioning

confidence: 99%

See 2 more Smart Citations

A novel XML document structure comparison framework based-on sub-tree commonalities and label semantics

Tekli

Chbeir

2012

Journal of Web Semantics

View full text Add to dashboard Cite

Section: Semantic Resemblance Between Sub-trees (Sem-rbs)mentioning

confidence: 75%

Section: Figmentioning

confidence: 99%

Section: Integrating Structural and Semantic Similaritymentioning

confidence: 99%

See 1 more Smart Citation

A novel XML document structure comparison framework based-on sub-tree commonalities and label semantics

Tekli

Chbeir

2012

Journal of Web Semantics

View full text Add to dashboard Cite

“…In case of similarity of a document D and a schema S there are also two types of strategies -techniques which measure the number of elements which appear in D but not in S and vice versa (e.g. [1]) and techniques which measure the closest distance between D and "all" documents valid against S (e.g. [8]).…”

Section: Related Workmentioning

confidence: 99%

Equivalence of XSD Constructs and Its Exploitation in Similarity Evaluation

Mlýnková

2008

On the Move to Meaningful Internet Systems: OTM 2008

View full text Add to dashboard Cite

Abstract. In this paper we propose a technique for evaluating similarity of XML Schema fragments. Firstly, we define classes of structurally and semantically equivalent XSD constructs. Then we propose a similarity measure that is based on the idea of edit distance utilized to XSD constructs and enables one to involve various additional similarity aspects. In particular, we exploit the equivalence classes and semantic similarity of element/attribute names. Using preliminary experiments we show the behavior and advantages of the proposal.

show abstract

“…Our embedding relation closely resembles the notion of simulation (for the formal definition, see [2]), which has been widely used in a number of works about querying, transformation, and verification of semistructured data (cf. [6,1,15,5] It is important to have an efficient implementation of homeomorphic embedding because it is used repeatedly during the verification process as described in the following.…”

Section: Rule-based Web Site Verificationmentioning

confidence: 99%

A Fast Algebraic Web Verification Service

Alpuente

Ballis²,

Falaschi³

et al.

Web Reasoning and Rule Systems

View full text Add to dashboard Cite

Abstract. In this paper, we present the rewriting-based, Web verification service WebVerdi-M, which is able to recognize forbidden/incorrect patterns and incomplete/missing Web pages. WebVerdi-M relies on a powerful Web verification engine that is written in Maude, which automatically derives the error symptoms. Thanks to the AC pattern matching supported by Maude and its metalevel facilities, WebVerdi-M enjoys much better performance and usability than a previous implementation of the verification framework. By using the XML Benchmarking tool xmlgen, we develop some scalable experiments which demonstrate the usefulness of our approach.

show abstract

A matching algorithm for measuring the structural similarity between an XML document and a DTD and its applications

Cited by 80 publications

References 23 publications

A novel XML document structure comparison framework based-on sub-tree commonalities and label semantics

A novel XML document structure comparison framework based-on sub-tree commonalities and label semantics

Equivalence of XSD Constructs and Its Exploitation in Similarity Evaluation

A Fast Algebraic Web Verification Service

Contact Info

Product

Resources

About