2021
DOI: 10.48550/arxiv.2103.13577
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ButterFly BFS -- An Efficient Communication Pattern for Multi Node Traversals

Abstract: Breadth-First Search (BFS) is a building block used in a wide array of graph analytics and is used in various network analysis domains: social, road, transportation, communication, and much more. Over the last two decades, network sizes have continued to grow. The popularity of BFS has brought with it a need for significantly faster traversals. Thus, BFS algorithms have been designed to exploit shared-memory and shared-nothing systems-this includes algorithms for accelerators such as the GPU. GPUs offer extrem… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 36 publications
0
2
0
Order By: Relevance
“…Note however that standard GPU systems that rely on PCIe interconnects rather than switched NVLINK are far less competitive. A recent preprint [33] claimed a performance of more than 300 GT EP S for a large Kronecker Graph on 16 V100 GPUs in a DGX-2 system, but at the time of this writing, the code was not publicly available.…”
Section: Discussionmentioning
confidence: 99%
“…Note however that standard GPU systems that rely on PCIe interconnects rather than switched NVLINK are far less competitive. A recent preprint [33] claimed a performance of more than 300 GT EP S for a large Kronecker Graph on 16 V100 GPUs in a DGX-2 system, but at the time of this writing, the code was not publicly available.…”
Section: Discussionmentioning
confidence: 99%
“…Therefore, algorithms must find a static partitioning and communication scheme, or do redistribution using e.g. butterfly communication patterns [27]. Many-to-many sequence alignments can be modeled as a graph problem and benefit from sharing sequences between different partitions without additional host-to-device transfers [28].…”
Section: G Ipu-specific Challengesmentioning
confidence: 99%