This paper presents an end-to-end massively parallelized procedure for the solution of boundary value problems on Graphics Processing Units (GPU). The proposal is an integrated strategy that not only entails the calculation of nodal contributions, and the stiffness matrix assembly using the Meshless Local Petrov Galerkin Method (MLPG) but also the iterative solution of the system of algebraic equations in combination with methods from the Conjugate Gradient (CG) family. This end-to-end solution is fully developed using the Compute Unified Device Architecture (CUDA) platform without the need for extra data movement between the device and host after the matrix assembly. This is possible thanks to the parallel nature of the MLPG; each node designates a thread on the device. The introduced solution is wholly executed in the GPU, with minimal auxiliary structures or global synchronization points. The proposed approach was applied to the solution of a simple electromagnetic problem, and a sevenfold speedup was observed.
Modo de acesso: World Wide WebInclui bibliografia 1. Educação 2. Ensino Superior 3. Engenharia I. Título.
CDD-370O conteúdo dos artigos e seus dados em sua forma, correção e confiabilidade são de responsabilidade exclusiva dos seus respectivos autores www.poisson.com.br
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.