2016
DOI: 10.1038/srep28955
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of large networks with sub-sampling strategies

Abstract: Networks are routinely used to represent large data sets, making the comparison of networks a tantalizing research question in many areas. Techniques for such analysis vary from simply comparing network summary statistics to sophisticated but computationally expensive alignment-based approaches. Most existing methods either do not generalize well to different types of networks or do not provide a quantitative similarity score between networks. In contrast, alignment-free topology based network similarity score… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 16 publications
(21 citation statements)
references
References 33 publications
0
21
0
Order By: Relevance
“…where d min > 0 is a cutoff determining where the tail of the degree distribution starts. Most of empirical graphs are reported to haveγ Hill ∈ [2, 3], while graphex processes have almost-surely index γ = 1 + σ ∈ [1,2], at least if we rely on the definition of equation (4.3). So one might think graphex processes are unrealistic to model graphs withγ Hill ∈ [2,3].…”
Section: Relation To Hill Estimatormentioning
confidence: 99%
“…where d min > 0 is a cutoff determining where the tail of the degree distribution starts. Most of empirical graphs are reported to haveγ Hill ∈ [2, 3], while graphex processes have almost-surely index γ = 1 + σ ∈ [1,2], at least if we rely on the definition of equation (4.3). So one might think graphex processes are unrealistic to model graphs withγ Hill ∈ [2,3].…”
Section: Relation To Hill Estimatormentioning
confidence: 99%
“…The subsampling procedure of Bhattacharyya and Bickel (2015) focuses on assessment of uncertainty in motif, or subgraph counts in dense exchangeable networks and is not feasible for inference on degree distribution. The method of Ali et al (2016) also targets inference on counts of small sub-graphs and is based on the notion of dependency graphs under the network exchangeability framework. The vertex bootstrap approach of Snijders and Borgatti (1999), implemented in our R package snowboot, does not impose density limitations but assumes availability of the whole network upfront and is applicable to only small networks.…”
Section: Assumptionsmentioning
confidence: 99%
“…This opens the door for comparing very large networks which are outside the reach of current methods while still retaining state of the art performance. Furthermore, the NetEmd measures perform well under sub-sampling of nodes [Ali et al, 2016] (see Appendix D) which can be leveraged to further improve computational efficiency.…”
Section: Netemd Based On Different Sets Of Inputsmentioning
confidence: 99%