Abstract:This paper describes a parallel implementation of the discontinuous Galerkin method. The discontinuous Galerkin is a spatially compact method that retains its accuracy and robustness on non-smooth unstructured grids and is well suited for time dependent simulations. Several parallelization approaches are studied and evaluated. The most natural and symmetric of the approaches has been implemented in an object-oriented code used to simulate aeroacoustic scattering. The parallel implementation is MPI-based and ha… Show more
“…The super linear speed up in Fig. 3 can be attributed to the cache effect [6]. Speed up from the parallelization of the overall algorithm versus different number of processors will be presented in the conference.…”
Section: Mpi Parallelization Of the Modelmentioning
“…The super linear speed up in Fig. 3 can be attributed to the cache effect [6]. Speed up from the parallelization of the overall algorithm versus different number of processors will be presented in the conference.…”
Section: Mpi Parallelization Of the Modelmentioning
“…We have allowed the computation of auxiliary diagnostic variables between the call of send and receive. Further code revision could include the computation of the right-hand side and internal flux computations as described in Baggag et al (1999). In Algorithm 3 we describe how we overlap the computation of auxiliary variables and the computation of the gradient of the solution for the diffusion operator with the communication of the fluxes.…”
Section: Overlapping For the Dg Methodsmentioning
confidence: 99%
“…This allows the computation of internal edge and element integra communication. We have allowed the computation of auxiliary diagnostic variables between th Further code revision could include the computation of the right hand side and internal flux co (Baggag et al (1999)). In Algorithm 3 we describe how we overlap the computation of auxiliary va 8 Most notable about the implementation explained above is that, even in the absence of additional computation to be completed during communication, the packing and unpacking of the buffers provides some data movement to be accomplished while waiting for messages to be received.…”
Abstract. The scalability of computational applications on current and next-generation supercomputers is increasingly limited by the cost of inter-process communication. We implement non-blocking asynchronous communication in the High-Order Methods Modeling Environment for the time integration of the hydrostatic fluid equations using both the spectral-element and discontinuous Galerkin methods. This allows the overlap of computation with communication, effectively hiding some of the costs of communication. A novel detail about our approach is that it provides some data movement to be performed during the asynchronous communication even in the absence of other computations. This method produces significant performance and scalability gains in large-scale simulations.
“…Moreover, for large simulation in 3D, the use of supercomputers and scalable parallel algorithms is indispensable. Early research on the parallel implementation of DG for various applications can be found in [2,8,43,7].…”
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.