2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2016
DOI: 10.1109/ipdps.2016.122
|View full text |Cite
|
Sign up to set email alerts
|

Subgraph Counting: Color Coding Beyond Trees

Abstract: The problem of counting occurrences of query graphs in a large data graph, known as subgraph counting, is fundamental to several domains such as genomics and social network analysis. Many important special cases (e.g. triangle counting) have received significant attention. Color coding is a very general and powerful algorithmic technique for subgraph counting. Color coding has been shown to be effective in several applications, but scalable implementations are only known for the special case of tree queries (i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 19 publications
(16 citation statements)
references
References 31 publications
(81 reference statements)
0
16
0
Order By: Relevance
“…A subsequent distributed scalable implementation of CC, SCALA [20], allowed the authors to count on graphs with 1-2M nodes the number of non-induced paths and trees. Another recent effort to scale CC is [7]: using a distributed algorithm, the authors estimate the occurrences of 10 different subgraphs of treewidth 2 and size up to k = 10 nodes, in graphs of up to 2M nodes. While these encouraging results make clear that CC is a promising approach, they leave wide open the important question of estimating the distribution of induced subgraphs, aka graphlets.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…A subsequent distributed scalable implementation of CC, SCALA [20], allowed the authors to count on graphs with 1-2M nodes the number of non-induced paths and trees. Another recent effort to scale CC is [7]: using a distributed algorithm, the authors estimate the occurrences of 10 different subgraphs of treewidth 2 and size up to k = 10 nodes, in graphs of up to 2M nodes. While these encouraging results make clear that CC is a promising approach, they leave wide open the important question of estimating the distribution of induced subgraphs, aka graphlets.…”
Section: Related Workmentioning
confidence: 99%
“…Counting graphlets is a well-studied problem in graph mining and social-networks analysis [1,3,7,8,11,14,18,20,[27][28][29]32]. Given an input graph, the problem asks to count the frequencies of all induced connected subgraphs (called graphlets), up to isomorphism, of a certain size.…”
Section: Introductionmentioning
confidence: 99%
“…Given a k-node template T , it assigns random colors between 0 and k−1 to each vertex of a network graph G, and it counts the number of the occurrences of colorful embedding, which is isomorphic to T while having distinct colors on each vertex. Both theoretical proof [9], [14], [6] and experiments [3], [15] show that, with proper normalization, the count of colorful embeddings is an unbiased estimator of the actual count of embeddings. Alon et al [9] proved a guarantee of bounding the count by (1± )emb(T, G) with a probability of 1 − 2δ after running at most N iterations of the algorithm.…”
Section: B Statement Of Problemmentioning
confidence: 92%
“…[5] • Computing kernel of other algorithms: Sub-tree counting is one of the computing kernels of bounded treewidth subgraph (such as circles, cactus graphs, series-parallel graphs etc.) counting problem [6] and also the kernel of network clustering [7]. Despite subgraph counting plays an important role in discovery of patterns in a graph network, counting the exact number of subgraphs of size k in a n-vertex network takes O(n k ) time [4], which is computationally challenging even for moderate values of n and k. In fact, determining whether a graph G contains a subgraph to H is a related graph isomorphic problem that is NP-complete [8].…”
Section: Introductionmentioning
confidence: 99%
“…in a graph. Graphlet counting has a long and rich history, which began with triangle counting and received intense interest in recent years [2,6,7,10,12,15,17,20,21,25,26,27,30]. Since exact graphlet counting is notoriously hard, one must resort to approximate probabilistic counting to obtain algorithms with an acceptable practical performance.…”
Section: Introductionmentioning
confidence: 99%