2012
DOI: 10.1109/tmag.2011.2177814
|View full text |Cite
|
Sign up to set email alerts
|

Power Aware Parallel 3-D Finite Element Mesh Refinement Performance Modeling and Analysis With CUDA/MPI on GPU and Multi-Core Architecture

Abstract: Software power performance tuning handles the critical design constraints of software running on hardware platforms composed of large numbers of power-hungry components. The power dissipation of a Single Program/Instruction Multiple Data (SPMD/SIMD) computation such as finite element method (FEM) mesh refinement is highly dependent on the underlying algorithm and the power-consuming features of hardware Processing Elements (PE). This contribution presents a practical methodology for modeling and analyzing the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(9 citation statements)
references
References 3 publications
0
9
0
Order By: Relevance
“…These values are important, mainly considering the 3D nontrivial domains, which were chosen to test our proposal. Therefore, unless the parallel computing resources are used (Bracken et al, 2012), neither the topological domain complexity, nor the mesh refinements and time processing performance of the examples discussed in relevant studies (Li et al, 2013;Chen and Biro, 2012;Zhao et al, 2012;Ho et al, 2011;Zhang and Kumar, 2011;Krebs et al, 2010) surpass the equivalent features of the test cases presented in this study, which were explored using one CPU core. Moreover, the approach described is able to provide highly quality meshes with common and very accessible hardware resources.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…These values are important, mainly considering the 3D nontrivial domains, which were chosen to test our proposal. Therefore, unless the parallel computing resources are used (Bracken et al, 2012), neither the topological domain complexity, nor the mesh refinements and time processing performance of the examples discussed in relevant studies (Li et al, 2013;Chen and Biro, 2012;Zhao et al, 2012;Ho et al, 2011;Zhang and Kumar, 2011;Krebs et al, 2010) surpass the equivalent features of the test cases presented in this study, which were explored using one CPU core. Moreover, the approach described is able to provide highly quality meshes with common and very accessible hardware resources.…”
Section: Resultsmentioning
confidence: 99%
“…The proposed method was tested considering solids with complex geometries: "C-type" magnet, turbocompressor device, Klein bottle and an electrical motor with supplementary permanent magnets. These choices were motivated by the several studies focused on computational electromagnetics considering similar geometries (Li et al, 2016;Chen and Biro, 2012;Wall et al, 2012;Zhao et al, 2012;Bracken et al, 2012;Zhang and Kumar, 2011;Ho et al, 2011;Chang et al, 2010;Jang et al, 2007;Cho et al, 2006).…”
Section: Application Contextmentioning
confidence: 99%
See 2 more Smart Citations
“…Kakay et al implement their finite element micromagnetic simulation in the GPU, demonstrating a high speed performance compared to CPU multi‐core implementation. Ren et al model and analyze power performance of parallel 3D finite element mesh refinement on CUDA and Message Passing Interface (MPI) architecture using multi‐core CPU and GPU cluster, also proposing parallelization techniques for both. In recent works on FEM, Markall et al use finite element advection‐diffusion solver to demonstrate that FEM implementations on many‐core (GPU) and multi‐core (CPU) architectures differ if their performance potential is to be obtained.…”
Section: Related Work On Graphics Processor Units and High Performancmentioning
confidence: 99%