Parallel Performance of an Iterative Solver Based on the Golub-Kahan Bidiagonalization

Adv. Model. and Simul. in Eng. Sci.

Darrigrand

Tardieu

et al. 2020

Self Cite

Kinematic relationships between degrees of freedom, also named multi-point constraints, are frequently used in structural mechanics. In this paper, the Craig variant of the Golub-Kahan bidiagonalization algorithm is used as an iterative method to solve the arising linear system with a saddle point structure. The condition number of the preconditioned operator is shown to be close to unity and independent of the mesh size. This property is proved theoretically and illustrated on a sequence of test problems of increasing complexity, including concrete structures enforced with pretension cables and the coupled finite element model of a reactor containment building. The Golub-Kahan algorithm converges in only a small number of steps for all considered test problems and discretization sizes. Furthermore, it is robust in practical cases that are otherwise considered to be difficult for iterative solvers.

“…We observe that the speed-up reaches half of the ideal speed-up. Such a result is consistent to what we observed in [14].…”

Section: Strong Scalingsupporting

confidence: 93%

Section: The Algorithmmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Application of an iterative Golub-Kahan algorithm to structural mechanics problems with multi-point constraints

Adv. Model. and Simul. in Eng. Sci.

Darrigrand

Tardieu

et al. 2020

Self Cite

“…As the reactor containment building test case from EDF was not yet available for the strong and weak scalability investigations, we have adapted Stokes sample problems provided in the PETSc distribution. In this work, we focus on extending and improving the parallel results presented in our conference contribution 8 with the following novel aspects: We ran the examples on a larger number of cores, that is, 1024, and we studied the scalability of a bigger matrix for the Poiseuille flow test case ( m ≈16.8·10 6 , n ≈ 8.4·10 6 ). We introduce a three‐dimensional Stokes example and we comment on its strong scalability. We discuss the weak scalability of the nested inner‐outer iterative variants of our solver on the three test cases in two‐ and three‐dimensions. By linking with the Intel Math Kernel Library (MKL) for executing dense linear algebra operations, we improve the previously obtained computation times, especially those for the employed parallel sparse direct solver MUMPS 9 (Multifrontal Massively Parallel sparse direct Solver). We obtained for example a speed‐up with a factor of more than 5 for the standalone MUMPS solver and of about 3 for GKB‐MUMPS for computations on two cores. We investigate the portability and present the performance of the algorithm on an Advanced Micro Devices (AMD) architecture. …”

Section: Introductionmentioning

confidence: 99%

Parallel solution of saddle point systems with nested iterative solvers based on the Golub‐Kahan Bidiagonalization

Concurrency and Computation

Sosonkina

Arioli

et al. 2020

Self Cite

Summary The Golub‐Kahan bidiagonalization is widely used in the singular value decomposition of rectangular matrices and has been generalized to an iterative solver for symmetric indefinite linear systems with a two‐by‐two block structure. In this work, we present a scalability study of this generalized solver as implemented in a recent release of the parallel numerical library PETSc (Portable, Extensible Toolkit for Scientific Computation). We present an improved solver performance for the two‐dimensional (2D) Stokes equations as compared to previous work. Furthermore, we investigate the performance of different parallel inner solvers in the outer Golub‐Kahan iteration for a three‐dimensional Stokes problem. The study includes parallel sparse direct solvers and multigrid methods. When increasing the number of cores for a fixed total problem size, the solver exhibits good speedups of up to 50% at the 1024 core count. For the tests in which the total problem size grows while the workload in each core stays constant, the parallel performance of the solver scales almost linearly with the increase in the core counts. In particular, the computation time increases only by about 15% when the number of cores increases from 80 to 1024 for a 2D test case.

Inexact inner–outer Golub–Kahan bidiagonalization method: A relaxation strategy

Darrigrand

Dumitrasc

Numerical Linear Algebra App

et al. 2022

Self Cite

We study an inexact inner–outer generalized Golub–Kahan algorithm for the solution of saddle‐point problems with a two‐times‐two block structure. In each outer iteration, an inner system has to be solved which in theory has to be done exactly. Whenever the system is getting large, an inner exact solver is, however, no longer efficient or even feasible and iterative methods must be used. We focus this article on a numerical study showing the influence of the accuracy of an inner iterative solution on the accuracy of the solution of the block system. Emphasis is further given on reducing the computational cost, which is defined as the total number of inner iterations. We develop relaxation techniques intended to dynamically change the inner tolerance for each outer iteration to further minimize the total number of inner iterations. We illustrate our findings on a Stokes problem and validate them on a mixed formulation of the Poisson problem.