Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data 2012
DOI: 10.1145/2213836.2213855
|View full text |Cite
|
Sign up to set email alerts
|

Query preserving graph compression

Abstract: It is common to find graphs with millions of nodes and billions of edges in, e.g., social networks. Queries on such graphs are often prohibitively expensive. These motivate us to propose query preserving graph compression, to compress graphs relative to a class Q of queries of users' choice. We compute a small Gr from a graph G such that (a) for any query Q ∈ Q, Q(G) = Q ′ (Gr), where Q ′ ∈ Q can be efficiently computed from Q; and (b) any algorithm for computing Q(G) can be directly applied to evaluating Q ′ … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
114
0
1

Year Published

2013
2013
2019
2019

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 170 publications
(115 citation statements)
references
References 44 publications
0
114
0
1
Order By: Relevance
“…The grouping of data vertices into hypernodes in our approach bears some similarity to structural summaries [10,8,2], graph summarization [11,14], and query-preserving graph compression [4]. Structural summaries are designed for path expressions, hence they group vertices sharing the same set of incoming label paths into a hypernode.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The grouping of data vertices into hypernodes in our approach bears some similarity to structural summaries [10,8,2], graph summarization [11,14], and query-preserving graph compression [4]. Structural summaries are designed for path expressions, hence they group vertices sharing the same set of incoming label paths into a hypernode.…”
Section: Related Workmentioning
confidence: 99%
“…These techniques group vertices into hypernodes based on a variety of statistics, such as node attributes values [16], degree distribution, or user-specified node attributes [14]. More closely related to our work is [4], which proposes a framework for query-preserving graph compression as well as two compression methods that preserve reachability queries and pattern matching queries (based on bounded simulation) respectively. Both methods are based on equivalence relations defined over the vertices of the original graph G, and compress G by merging vertices in the same equivalent class into a single node.…”
Section: Related Workmentioning
confidence: 99%
“…In contrast to lossless compression schemes (e.g., [6,9,17]), query preserving compression is relative to Q, i.e., it generates small Dc that preserves the information only relevant to queries in Q rather than preserving the entire original D. Hence it often achieves a better compression ratio than lossless compression. Indeed, this approach has proven effective in answering graph queries on large social network graphs [16,31,32] and in cryptographic applications [24]. If the compression can be conducted in PTIME [16,31,32] and moreover, queries in Q can be answered in the compressed databases Dc in parallel polylog-time, perhaps by combining with other techniques such as indexing, then Q is Π-tractable, i.e., Q ∈ ΠT 0 Q .…”
Section: (4) Lowest Common Ancestors (Lca) Consider L3mentioning
confidence: 99%
“…Indeed, this approach has proven effective in answering graph queries on large social network graphs [16,31,32] and in cryptographic applications [24]. If the compression can be conducted in PTIME [16,31,32] and moreover, queries in Q can be answered in the compressed databases Dc in parallel polylog-time, perhaps by combining with other techniques such as indexing, then Q is Π-tractable, i.e., Q ∈ ΠT 0 Q . (6) Query answering using views.…”
Section: (4) Lowest Common Ancestors (Lca) Consider L3mentioning
confidence: 99%
See 1 more Smart Citation