2017 Proceedings of the Ninteenth Workshop on Algorithm Engineering and Experiments (ALENEX) 2017
DOI: 10.1137/1.9781611974768.5
|View full text |Cite
|
Sign up to set email alerts
|

I/O-efficient Generation of Massive Graphs Following the LFR Benchmark

Abstract: LFR is a popular benchmark graph generator used to evaluate community detection algorithms. We present EM-LFR, the first external memory algorithm able to generate massive complex networks following the LFR benchmark. Its most expensive component is the generation of random graphs with prescribed degree sequences which can be divided into two steps: the graphs are first materialized deterministically using the Havel-Hakimi algorithm, and then randomized. Our main contributions are EM-HH and EM-ES, two I/O-effi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
21
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
2
1

Relationship

3
3

Authors

Journals

citations
Cited by 8 publications
(21 citation statements)
references
References 32 publications
0
21
0
Order By: Relevance
“…The convergence process is inherently slow and so the model has clear scalability limitations that are known to both academics and practitioners. The fastest published variant of the model that is able to generate large graphs is the external memory algorithm proposed by Hamann et al (2018).…”
Section: Problem With Scalabilitymentioning
confidence: 99%
See 1 more Smart Citation
“…The convergence process is inherently slow and so the model has clear scalability limitations that are known to both academics and practitioners. The fastest published variant of the model that is able to generate large graphs is the external memory algorithm proposed by Hamann et al (2018).…”
Section: Problem With Scalabilitymentioning
confidence: 99%
“…In this paper, we concentrate on single-threaded ABCD and LFR implementations in order to focus on the theoretical concepts behind ABCD. However, as an outlook for further work, it is possible to design a distributed out-of-core implementation of ABCD to generate huge graphs having billions of vertices, similarly like it is done in Hamann et al (2018) for LFR. Indeed, for example, generation of edges within communities can be performed using perfectly parallel approach, as each community is processed independently (see Section 3 for details).…”
Section: Problem With Scalabilitymentioning
confidence: 99%
“…* This work was partially supported by the DFG under grants ME 2088/3-2, WA 654/22-2. Parts of this paper were published as [21].…”
mentioning
confidence: 99%
“…Chapters 4 and 5, based on [82,167,168], are concerned with I/O-e cient Markov chain processes for the perturbation of simple graphs. In Chapter 4, we develop EM LFR, an I/O-e cient sampling pipeline for the LFR community detection benchmark, and engineer a parallel implementation able to produce graph instances orders of magnitude larger than the available main memory.…”
Section: Graphs From Prescribed Degree Sequencementioning
confidence: 99%
“…Hamann et al [166,167] propose an external memory algorithm that uses I/O-e cient ES and a streaming implementation of H H to generate graphs. Rewiring steps are also implemented based on ES.…”
Section: Em Lfr: Lfr In Emmentioning
confidence: 99%