2018
DOI: 10.1109/tpds.2017.2743708
|View full text |Cite
|
Sign up to set email alerts
|

GraphD: Distributed Vertex-Centric Graph Processing Beyond the Memory Limit

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 37 publications
(21 citation statements)
references
References 28 publications
0
19
0
Order By: Relevance
“…If a graph processing system uses an external storage, there is another I/O cost to read the partitioned graphs from the external storage. For example, systems like HybridGraph, GraphD [41] and Pregelix store the partitioned subgraphs in local disks of workers, and load the subgraphs in every iteration to avoid scalability issues such as out-ofmemory error. Reading the subgraphs from external storage causes massive disk I/Os.…”
Section: Costs Of Distributed Graph Processing Systemsmentioning
confidence: 99%
“…If a graph processing system uses an external storage, there is another I/O cost to read the partitioned graphs from the external storage. For example, systems like HybridGraph, GraphD [41] and Pregelix store the partitioned subgraphs in local disks of workers, and load the subgraphs in every iteration to avoid scalability issues such as out-ofmemory error. Reading the subgraphs from external storage causes massive disk I/Os.…”
Section: Costs Of Distributed Graph Processing Systemsmentioning
confidence: 99%
“…In-Memory Out-of-Core (use HDD if not indicated) In-Memory Out-of-Core In-Memory Out-of-Core Approaches Ligra [7] Galois [8] GraphMat [9] Polymer [10] GraphChi [31] X-Stream [32] VENUS [33] GridGraph [34] GraphMP (Use SSD) FlashGraph [35] TurboGraph [36] Medusa [11] Gunrock [13] MapGraph [14] gGraph [15] (Use SSD) GTS [16] GGraph [15] Pregel-like: [20] [21] [22] [23] [24] GAS: [25] [26] [27] SpMV: [28] [29] (Use HDD) GraphD [37] Chaos [38] Pregelix [ GraphChi, the edge-centric scatter-gather (ESG) model of X-Stream, the vertex-centric streamlined processing (VSP) model of VENUS, and the dual sliding windows (DSW) model of GridGraph. These approaches try to exploit the sequential bandwidth of hard disks and to reduce the amount of disk accesses.…”
Section: Single Machine (Cpu) Single Machine (Gpu) Cluster Data Storagementioning
confidence: 99%
“…After Pregel, many systems were proposed to optimize it. Giraph [21] is the open-source counterpart to Pregel; GraphX DBpedia 265 620 885 YAGO 651 1,822 2,473 Soc-Academia 187 479 666 Soc-YouTube 233 779 1,012 RoadNet-TX 2,287 3,039 5,326 RoadNet-CA 2,644 3,489 6,133 TABLE 2: Offline Performance (in min) [9] is Spark's graphs computing API, which also implements the Pregel operator; Pregel+ [25] proposes integration mirroring and message combining as well as a request-response mechanism; Quegel [28] extends the vertex-centric model to the query-centric model; TurboGraph++ [11] discusses how to process large graphs by exploiting external memory without compromising efficiency; GraphD [26] adopts a semistreaming model to avoid scanning the whole graph in each superstep, where only a portion of the vertex states are maintained in the main memory and edges and messages are streamed on the local disk; G-thinker [24] extends the vertexcentric model to the subgraph-centric model for computeintensive graph mining workloads.…”
Section: B Distributed Graph Processing Systemsmentioning
confidence: 99%
“…When graph models are used in an increasing number of applications, as one of the most classic problems in a graph, single-source shortest path length (SSSP length) queries have been studied for more than half a century and have received increasing attention [6], [16], [26]. Given a graph and a source vertex, an SSSP length query finds the distance from the source vertex to each vertex in the graph.…”
Section: Introductionmentioning
confidence: 99%