2014 IEEE International Parallel &Amp; Distributed Processing Symposium Workshops 2014
DOI: 10.1109/ipdpsw.2014.109
|View full text |Cite
|
Sign up to set email alerts
|

Dynamically Balanced Synchronization-Avoiding LU Factorization with Multicore and GPUs

Abstract: Graphics processing units (GPUs) brought huge performance improvements in the scientific and numerical fields. We present an efficient hybrid CPU/GPU computing approach that is portable, dynamically and efficiently balances the workload between the CPUs and the GPUs, and avoids data transfer bottlenecks that are frequently present in numerical algorithms. Our approach determines the amount of initial work to assign to the CPUs before the execution, and then dynamically balances workloads during the execution. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 17 publications
0
8
0
Order By: Relevance
“…After computation of execution time, we computed performance of saxpy in GFLOPs using eq (5). The Fig.7, plots comparison of pageable and pinned memory for saxpy applications on a hybrid programming model.…”
Section: Results Pinned and Pageable Data Transfer For Saxpy Applimentioning
confidence: 99%
See 3 more Smart Citations
“…After computation of execution time, we computed performance of saxpy in GFLOPs using eq (5). The Fig.7, plots comparison of pageable and pinned memory for saxpy applications on a hybrid programming model.…”
Section: Results Pinned and Pageable Data Transfer For Saxpy Applimentioning
confidence: 99%
“…Simplice Donfack et.al [5] present effective hybrid CPU/GPU approaches that is portable. It dynamically and efficiently balances the workload between the CPUs and the GPU.…”
Section: Np Karunadasa and D N Ranasingh [1] Had Demonstratedmentioning
confidence: 99%
See 2 more Smart Citations
“…Another auto-tuned library for this platform is MAGMA MIC [31], which uses a hybridization methodology, where the algorithms are split into computational tasks of varying granularity and, then, they are properly scheduled over the heterogeneous hardware. In [32] an auto-tuning method for the LU factorization is described. In this proposal, the initial amount of work to assign to both the CPU and GPU is determined before execution by means of an analytical model of the execution time, and the workload is dynamically balanced during the execution.…”
Section: Linear Algebra On Hybrid Platformsmentioning
confidence: 99%