2020
DOI: 10.7717/peerj-cs.247
|View full text |Cite
|
Sign up to set email alerts
|

Improving parallel executions by increasing task granularity in task-based runtime systems using acyclic DAG clustering

Abstract: The task-based approach is a parallelization paradigm in which an algorithm is transformed into a direct acyclic graph of tasks: the vertices are computational elements extracted from the original algorithm and the edges are dependencies between those. During the execution, the management of the dependencies adds an overhead that can become significant when the computational cost of the tasks is low. A possibility to reduce the makespan is to aggregate the tasks to make them heavier, while having fewer of them… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2025
2025

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 33 publications
0
5
0
Order By: Relevance
“…The order in which the conflicts are resolved is either in the decreasing (greedy max) or increasing (greedy min) order based on their cumulative weight. The source code for the evaluation and the generation of the figures is available online [7].…”
Section: Discussionmentioning
confidence: 99%
“…The order in which the conflicts are resolved is either in the decreasing (greedy max) or increasing (greedy min) order based on their cumulative weight. The source code for the evaluation and the generation of the figures is available online [7].…”
Section: Discussionmentioning
confidence: 99%
“…Several algorithms have been proposed for scheduling DAGs [43], [44], [45], which use either list-based or clustering-based scheduling heuristics, while GRAPHOPT takes a different approach of modeling the core routine of the tool as a constrained-optimization problem, allowing the use of open-source solvers. The constrained optimizationbased approach is explored in [46], [47].…”
Section: Graph Partitioningmentioning
confidence: 99%
“…We have tested several implementations. For more details on the implementation, we refer for instance to [2], where the algorithm is parallelized with the StarPU runtime, or to [5] where we use a specialized DAG (Direct Acyclic Graph) clustering algorithm. The DAG algorithm relies on an OpenMP implementation.…”
Section: Poloidal Parallelism: Task Graphmentioning
confidence: 99%
“…The resulting scheme has also nice parallelization possibilities. In a poloidal plane, the blocktriangular linear systems resulting from the DG scheme that are well solved by an optimized task-based implementation [2,5]. In the toroidal direction, the transport equations are solved by a simple shift operator.…”
Section: Introductionmentioning
confidence: 99%