Optimization of Finite-Differencing Kernels for Numerical Relativity Applications

Alfieri, Roberto; Bernuzzi, Sebastiano; Perego, Albino; Radice, David

doi:10.3390/jlpea8020015

Cited by 4 publications

(3 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Generic spatial field derivatives in the bulk (away from ∂Ω) are computed with high-order, centered, finite difference (FD) stencils whereas shift advection terms use stencils lopsided by one grid point Zlochower et al (2005); Husa et al (2008);Brügmann et al (2008); Chirvasa & Husa (2010). The implementation is based on Alfieri et al (2018) and utilizes C++ templates to offer flexibility in problem-specific accuracy demands without performance penalties. A similar approach is taken for implementation of the R and P operators discussed in §2.2.3.…”

Section: Numerical Techniquementioning

confidence: 99%

“…These attractive properties served as a strong motivation in development of GR-Athena++ where we have implemented the Z4c formulation Bernuzzi & Hilditch (2010); Ruiz et al (2011); Weyhausen et al (2012); Hilditch et al (2013) of NR utilizing the (moving) puncture gauge Brandt & Brügmann (1997); Baker et al (2007); Campanelli et al (2006). We provide accurate and efficient extensions to derivative approximants through (templated) arbitrary-order FD based on Alfieri et al (2018). Our introduction of vertex-centered (VC) variable treatment (extending core cell-and facecentered functionality) is motivated by a desire to match any selected FD order in calculations that involve AMR.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

GRAthena++: puncture evolutions on vertex-centered oct-tree AMR

Daszuta,

Zappa,

Cook

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Numerical relativity is central to the investigation of astrophysical sources in the dynamical and strong-field gravity regime, such as binary black hole and neutron star coalescences. Current challenges set by gravitational-wave and multi-messenger astronomy call for highly performant and scalable codes on modern massively-parallel architectures. We present GR-Athena++, a general-relativistic, high-order, vertex-centered solver that extends the oct-tree, adaptive mesh refinement capabilities of the astrophysical (radiation) magnetohydrodynamics code Athena++. To simulate dynamical spacetimes GR-Athena++ uses the Z4c evolution scheme of numerical relativity coupled to the moving puncture gauge. We demonstrate stable and accurate binary black hole merger evolutions via extensive convergence testing, cross-code validation, and verification against state-of-the-art effective-one-body waveforms. GR-Athena++ leverages the task-based parallelism paradigm of Athena++ to achieve excellent scalability. We measure strong scaling efficiencies above 95% for up to ∼ 1.2 × 10 4 CPUs and excellent weak scaling is shown up to ∼ 10 5 CPUs in a production binary black hole setup with adaptive mesh refinement. GR-Athena++ thus allows for the robust simulation of compact binary coalescences and offers a viable path towards numerical relativity at exascale.

show abstract

Section: Numerical Techniquementioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

GRAthena++: puncture evolutions on vertex-centered oct-tree AMR

Daszuta,

Zappa,

Cook

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…The article [16] is devoted to a method of multithreading optimization using OpenMP applied for two problems: wave equation and linearized Einstein equations. The presented pictures show good speedup scaling.…”

Section: Related Workmentioning

confidence: 99%

Multithreaded Acceleration of 3D Mathematical Model for Ore Sintering

Krasnikov¹

2021

IJC

View full text Add to dashboard Cite

One of the widely used methods to accelerate a numerical solver is implementation of multithreading. The problem of thread allocation on-demand at runtime is latency, caused by periodical instantiation of threads. The article is devoted to parallelization of solver for 3D mathematical model of ore sintering, based on software threads reusing them during computation. Computational domain is equally shared among available threads. Each thread writes only to own data partition. A looped barrier is proposed for guaranteed synchronization of all threads after iteration. The method allows scaling performance without recompilation of the solver by using similar CPU with more cores. Measurement of solver performance with 220 nodes using different thread count confirms scalability around 95% for double and single precision arithmetics. Presented pictures of perspective view with three slices of temperature field show influence of heat loss from pallets walls. A cross section of temperature field in layer after 16 minutes of sintering is calculated with appearance of two high-temperature regions inside. Comparison of temperature field with literature data gives good correspondence. The computer model takes into account important chemical reactions, such as, coke burning, carbonate dissolution, water vaporization, as well as mass-heat transfer inside the sinter layer and can be used in metallurgical plants to increase effectiveness of sintering.

show abstract