“…To list a few of the CUDAaccelerated CFD applications, Elsen et al 11 reported a 3D high-order FDM solver for large calculation on multi-block structured grids; Klöckner et al 16 developed a 3D unstructured high-order nodal DGM solver for the Maxwell's equations; Corrigan et al 10 proposed a 3D FVM solver for compressible inviscid flows on unstructured tetrahedral grids; Zimmerman et al 29 presented an SDM solver for the Navier-Stokes equations on unstructured hexahedral grids; and more as in the references. 3,4,12,7,21,23,13,19,8,14,1,9 However applying CUDA to a legacy CFD code is not likely an easy job since the developer has to define an explicit layout of the threads on the GPU (numbers of blocks, numbers of threads) for each kernel function. 15 So what if the CFD code designers have to meet specific investment requirements like (1) enable GPU computing for legacy CFD programs at a minimum extra cost in time and effort (usually a major concern for large-scale code development), (2) enable the GPU-accelerated programs running on different platforms (similar to the situation that the video game designers would like to make their products available across platforms)?…”