2020
DOI: 10.1007/978-3-030-48340-1_6
|View full text |Cite
|
Sign up to set email alerts
|

Optimizing Memory Bandwidth Efficiency with User-Preferred Kernel Merge

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 17 publications
0
2
0
Order By: Relevance
“…First, we generated code versions with and without blocking for the Broadwell processor and the P100 GPU. An excerpt of results presented for different size of grids is shown in [29] and in Fig. 5a.…”
Section: Evaluating Blockingmentioning
confidence: 99%
See 1 more Smart Citation
“…First, we generated code versions with and without blocking for the Broadwell processor and the P100 GPU. An excerpt of results presented for different size of grids is shown in [29] and in Fig. 5a.…”
Section: Evaluating Blockingmentioning
confidence: 99%
“…To explore the benefit of kernel merging, experiments were done using the shallow water equation solver code on the NEC test system (for vector engines) and Piz Daint (for GPUs and CPUs) [29]. The performance of regular and merged kernels are shown in Table 3.…”
Section: Evaluating Inter-kernel Optimizationmentioning
confidence: 99%