2020 IEEE/ACM 3rd Annual Parallel Applications Workshop: Alternatives to MPI+X (PAW-ATM) 2020
DOI: 10.1109/pawatm51920.2020.00006
|View full text |Cite
|
Sign up to set email alerts
|

Hedgehog: Understandable Scheduler-Free Heterogeneous Asynchronous Multithreaded Data-Flow Graphs

Abstract: Getting performance on high-end heterogeneous nodes is challenging. This is due to the large semantic gap between a computation's specification-possibly mathematical formulas or an abstract sequential algorithm-and its parallel implementation; this gap obscures the program's parallel structures and how it gains or loses performance. We present Hedgehog, a library aimed at coarse-grain parallelism. It explicitly embeds a data-flow graph in a program and uses this graph at runtime to drive the program's executio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…This state identifies when a partial result block can be emitted so it can be used in following operations. Our implementations use the data flow library, Hedgehog [5], which requires a developer to explicitly represent an algorithm's state and makes it easy to reason about this state. The implementations of both algorithms use the block streaming design pattern described in Section 1 and invoke BLAS functions from the OpenBLAS [6] library to operate on blocks.…”
Section: Algorithm Developmentmentioning
confidence: 99%
“…This state identifies when a partial result block can be emitted so it can be used in following operations. Our implementations use the data flow library, Hedgehog [5], which requires a developer to explicitly represent an algorithm's state and makes it easy to reason about this state. The implementations of both algorithms use the block streaming design pattern described in Section 1 and invoke BLAS functions from the OpenBLAS [6] library to operate on blocks.…”
Section: Algorithm Developmentmentioning
confidence: 99%