2017
DOI: 10.1016/j.combustflame.2017.02.005
|View full text |Cite
|
Sign up to set email alerts
|

An investigation of GPU-based stiff chemical kinetics integration methods

Abstract: A fifth-order implicit Runge-Kutta method and two fourth-order exponential integration methods equipped with Krylov subspace approximations were implemented for the GPU and paired with the analytical chemical kinetic Jacobian software pyJac. The performance of each algorithm was evaluated by integrating thermochemical state data sampled from stochastic partially stirred reactor simulations and compared with the commonly used CPU-based implicit integrator CVODE. We estimated that the implicit Runge-Kutta method… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
20
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 20 publications
(20 citation statements)
references
References 68 publications
(146 reference statements)
0
20
0
Order By: Relevance
“…The OpenMP code-generation is currently only capable of parallel execution, but extending this platform to shallow/deep-vectorizations (via loo.py and compiler #pragmas) is a key priority going forwards since OpenMP is a standard library on most machines. In addition, CUDA [51] has been significantly more reliable in previous works [18,24], while Intel's opensource OpenCL alternative, ISPC [78], has been relatively stable and easy to work with during preliminary efforts with the unit-testing discussed in Section 2.3. The current deep-vectorization formulation would be executable for both CUDA and ISPC targets, as these languages implement doubleprecision atomic operations, further recommending their use.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…The OpenMP code-generation is currently only capable of parallel execution, but extending this platform to shallow/deep-vectorizations (via loo.py and compiler #pragmas) is a key priority going forwards since OpenMP is a standard library on most machines. In addition, CUDA [51] has been significantly more reliable in previous works [18,24], while Intel's opensource OpenCL alternative, ISPC [78], has been relatively stable and easy to work with during preliminary efforts with the unit-testing discussed in Section 2.3. The current deep-vectorization formulation would be executable for both CUDA and ISPC targets, as these languages implement doubleprecision atomic operations, further recommending their use.…”
Section: Discussionmentioning
confidence: 99%
“…In addition,~9 × and~25 × speedups were achieved for the simulation of a partially premixed methanol flame for solving partial differential equations (PDE) on 16 CPU cores and the GPU, respectively. Curtis et al [24] implemented a fifth-order implicit Runge-Kutta method [44], as well as two fourth-order exponential integration techniques [45,46] paired with an analytical Jacobian code [18] on the GPU and CPU. The GPU-based implicit Runge-Kutta method performed equivalently to a standard implicit integrator [47] running on 12-38 CPU cores for two relatively small chemical models with an integration time step of 10 −6 s.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…A thorough performance analysis between GPUs and CPUs solving large numbers of ordinary differential equations is beyond the scope of the present study. The interested reader is referred to the publications [38,[50][51][52] for more details.…”
Section: Numerical Implementation and Parameter Choicementioning
confidence: 99%