2016
DOI: 10.1080/10618562.2016.1164309
|View full text |Cite
|
Sign up to set email alerts
|

Implementation and efficiency analysis of parallel computation using OpenACC: a case study using flow field simulations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 16 publications
0
3
0
Order By: Relevance
“…Since the explicit time-stepping procedure is adopted, and within a single time step the numerical mesh calculation is uncorrelated. Therefore, the loop computation on each grid can be performed in parallel (Zhang et al 2017). In this study, the hybrid MPI/OpenACC is applied to leverage multiple GPUs on a single node.…”
Section: Parallel Implementationmentioning
confidence: 99%
See 1 more Smart Citation
“…Since the explicit time-stepping procedure is adopted, and within a single time step the numerical mesh calculation is uncorrelated. Therefore, the loop computation on each grid can be performed in parallel (Zhang et al 2017). In this study, the hybrid MPI/OpenACC is applied to leverage multiple GPUs on a single node.…”
Section: Parallel Implementationmentioning
confidence: 99%
“…Herdman et al (2012) Clover-Leaf mini application accelerated with the OpenACC programming model, achieving a 4.91 speedup with the NVIDIA X2090 GPU over a 16-core CPU. Zhang et al (2016) the flow field case was performed using an OpenACC parallel computing application, with a speedup of 18.6 achieved by employing the NVIDIA Quadro K2000 GPU compared to a 4-core CPU. Zhang et al (2017) proposed a 2D parallel dam break model using OpenACC applications and obtained 20.7 speedups using the NVIDIA Tesla K20c GPU versus a 4-core CPU.…”
Section: Introductionmentioning
confidence: 99%
“…Lamb et al [30] reported a speedup factor of 112× for a GPU code run on a NVIDIA GeForce 8800GTX over the serial JFLOW code. Zhang et al [31,32] applied a GPU-based parallel method, OpenACC, to the parallel calculation of the flow field and the dam-break model. Rueda et al [33] performed parallel experiments on OpenACC and CUDA based on an algorithm for simulating flood storage area and on the DEM dataset of Africa.…”
Section: Introductionmentioning
confidence: 99%