2016 IEEE 16th International Conference on Data Mining (ICDM) 2016
DOI: 10.1109/icdm.2016.0029
|View full text |Cite
|
Sign up to set email alerts
|

Waddling Random Walk: Fast and Accurate Mining of Motif Statistics in Large Graphs

Abstract: Algorithms for mining very large graphs, such as those representing online social networks, to discover the relative frequency of small subgraphs within them are of high interest to sociologists, computer scientists and marketeers alike. However, the computation of these network motif statistics via naive enumeration is infeasible for either its prohibitive computational costs or access restrictions on the full graph data. Methods to estimate the motif statistics based on random walks by sampling only a small … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
53
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 50 publications
(54 citation statements)
references
References 37 publications
0
53
0
Order By: Relevance
“…The variance of sampling algorithms can be decomposed into two parts, an independent sample variance component and a between sample covariance component. As we have seen the independent variance component is based on the properties of π resulting from the procedure (see (11)). The covariance component will be low if the samples are not highly dependent on the past, namely that the Markov chain is well mixed.…”
Section: Theoretical Analysis Of Liftingmentioning
confidence: 99%
“…The variance of sampling algorithms can be decomposed into two parts, an independent sample variance component and a between sample covariance component. As we have seen the independent variance component is based on the properties of π resulting from the procedure (see (11)). The covariance component will be low if the samples are not highly dependent on the past, namely that the Markov chain is well mixed.…”
Section: Theoretical Analysis Of Liftingmentioning
confidence: 99%
“…Approximate counting of 3-, 4-, 5-vertex graphlets in static graphs has received much more attention than exact counting, which has an exponential cost. Most of the literature on approximate counting of graphlets uses randomwalks to collect a uniform sample of graphlets on static graphs [3,8,16,37]. Alternatively, Bressan et al [6] proposed a color coding based scheme for estimating k-vertex graphlet statistics.…”
Section: Related Workmentioning
confidence: 99%
“…Relation to Existing Work: MMGAN uses multiple techniques for learning on graphs and combines them into a motif-aware model. Random walks on graphs are widely used to learn the local and global topology of a graph [17][18][19], while biased random walks are used to characterize higher-order network structures like hyperedges and network motifs [8,[20][21][22][23][24]. Generative Adversarial Networks (GAN) are highly effective at learning implicit features of a data set and using these to generate realistic data samples.…”
Section: Introductionmentioning
confidence: 99%