2014
DOI: 10.1007/s00224-014-9544-x
|View full text |Cite
|
Sign up to set email alerts
|

XML Compression via Directed Acyclic Graphs

Abstract: Unranked node-labeled trees can be represented using their minimal dag (directed acyclic graph). For XML this achieves high compression ratios due to their repetitive mark up. Unranked trees are often represented through first child/next sibling (fcns) encoded binary trees. We study the difference in size (= number of edges) of minimal dag versus minimal dag of the fcns encoded binary tree. One main finding is that the size of the dag of the binary tree can never be smaller than the square root of the size of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
27
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
4
4
1

Relationship

4
5

Authors

Journals

citations
Cited by 27 publications
(27 citation statements)
references
References 29 publications
0
27
0
Order By: Relevance
“…Whereas dags only allow to share repeated subtrees, TSLPs can also share repeated internal tree patterns. Several grammar-based tree compressors were developed in [2,8,11,28,37], where the work from [2] is based on another type of tree grammars (elementary ordered tree grammars). The algorithm from [28] achieves an approximation ratio of O(log n) (for a constant set of node labels).…”
Section: Introductionmentioning
confidence: 99%
“…Whereas dags only allow to share repeated subtrees, TSLPs can also share repeated internal tree patterns. Several grammar-based tree compressors were developed in [2,8,11,28,37], where the work from [2] is based on another type of tree grammars (elementary ordered tree grammars). The algorithm from [28] achieves an approximation ratio of O(log n) (for a constant set of node labels).…”
Section: Introductionmentioning
confidence: 99%
“…It should be noted that (4) holds only if Φ is invariant under permutation of arguments. In addition,…”
Section: A Proof Of Propositionmentioning
confidence: 99%
“…Motivated by applications where large tree structured data occur, like XML processing, grammar-based compression has been extended to trees [9,10,26,33], see [31] for a survey. Unless otherwise specified, a tree in this paper is always a rooted ordered tree over a ranked alphabet, i.e., every node is labelled with a symbol and the rank of this symbol is equal to the number of children of the node.…”
Section: Introductionmentioning
confidence: 99%