An efficient coarse‐grained parallel algorithm for global–local multiscale computations on massively parallel systems

Rahul, *; De, Suvranu

doi:10.1002/nme.2776

Cited by 15 publications

(10 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The resulting Krylov subspace basis and upper Hessenberg matrix are used to initialize the preconditioner in Equation (55). The macroscale computation is performed sequentially while the microscale computations are performed in parallel using a logical hierarchical topology [38] on the IBM Blue Gene/L supercomputing platform at Rensselaer Polytechnic Institute's (RPI) Computational Center for Nanotechnology Innovations (CCNI).…”

Section: Methodsmentioning

confidence: 99%

An efficient block preconditioner for Jacobian‐free global–local multiscale methods

Rahul

2011

Numerical Meth Engineering

Self Cite

View full text Add to dashboard Cite

In this paper, we develop a block preconditioner for Jacobian-free global-local multiscale methods, in which the explicit computation of the Jacobian may be circumvented at the macroscale by using a Newton-Krylov process. Effective preconditioning is necessary for the Krylov subspace iterations (e.g. GMRES) to enhance computational efficiency. This is, however, challenging since no explicit information regarding the Jacobian matrix is available. The block preconditioning technique developed in this paper circumvents this problem by effectively deflating the spectrum of the Jacobian matrix at the current Newton step using information about only the Krylov subspaces corresponding to the Jacobian matrices in the previous Newton steps and their representations on those subspaces. This approach is optimal and results in exponential convergence of the GMRES iterations within each Newton step, thus minimizing expensive microscale computations without requiring explicit Jacobian formation in any step. In terms of both computational cost and storage requirements, the action of a single block of the preconditioner per GMRES step scales linearly as the number of degrees of freedom of the macroscale problem as well as the dimension of the invariant subspace of the preconditioned Jacobian matrix.

show abstract

Section: Methodsmentioning

confidence: 99%

An efficient block preconditioner for Jacobian‐free global–local multiscale methods

Rahul

2011

Numerical Meth Engineering

Self Cite

View full text Add to dashboard Cite

show abstract

“…Only the group leader (or master) can communicate with other masters of different groups. Rahul and Suvranu De proved that this two‐level grouping is superior over a naive parallelization in which each sub‐domain is treated by one CPU.…”

Section: Revisit To Two‐level Grouping Parallel Algorithmmentioning

confidence: 99%

“…Only the group leader (or master) can communicate with other masters of different groups. Rahul and Suvranu De [11] proved that this two-level grouping is superior over a naive parallelization in which each sub-domain is treated by one CPU. Figure 3 compares a simple parallelization without grouping and the two-level algorithm, with emphasis on the communication channel.…”

Section: Revisit To Two-level Grouping Parallel Algorithmmentioning

confidence: 99%

See 1 more Smart Citation

Multilayered grouping parallel algorithm for multiple‐level multiscale analyses

Cho

2014

Numerical Meth Engineering

View full text Add to dashboard Cite

SUMMARYMultiscale analysis technique became a successful remedy to complicated problems in which nonlinear behavior is linked with microscopic damage mechanisms. For efficient parallel multiscale analyses, hierarchical grouping algorithms (e.g., the two-level 'coarse-grained' method) have been suggested and proved superior over a simple parallelization. Here, we expanded the two-level algorithm to give rise to a multilayered grouping parallel algorithm suitable for large-scale multiple-level multiscale analyses. With practical large-scale applications, we demonstrated the superior performance of multilayered grouping over the coarse-grained method. Notably, we show that the unique data transfer rates of the symmetric multiprocessor cluster system can lead to the seemingly 'super-linear speedup' and that there appears to exist the optimal number of subgroups of three-tiered multiscale analysis.

show abstract

“…The issue of computational complexity is addressed using parallel implementation strategies, reduced order modeling at the coarse scale using high order (i.e., plate and shell) theories or reduced order modeling at the fine scales to efficiently evaluate the microscale response, as well as a combination of these three approaches. Parallelization of the computational homogenization [12,14,15] is natural and domain decomposition is readily applicable due to the local character of the microscale boundary value problems that are typically evaluated at the integration points of the macroscale grid. Model reduction at the coarse scale is achieved by exploiting the characteristics of the macroscopic domain.…”

Section: Introductionmentioning

confidence: 99%

Identification of Optimal Reduced Order Homogenization Models for Failure of Heterogeneous Materials

Sparks

Oskay

2013

Int J Mult Comp Eng

View full text Add to dashboard Cite

This manuscript presents a new methodology for the identification of optimal reduced order models for the inelastic and failure response of heterogeneous materials. The proposed methodology employs the eigendeformation-based reduced order homogenization approach. The identification of the optimal reduced order model is posed as an integer optimization problem and the genetic algorithm method is used to evaluate the optimization problem. A second optimization problem is posed to ensure that the errors associated with the optimal reduced order model are minimized through scaling of the failure parameters. The performance and capabilities of the optimal reduced order models identified based on the proposed approach are demonstrated by comparing model predictions with the computational homogenization method with full resolution of the material microstructure. Numerical simulations conducted using unidirectional reinforced matrix microstructures reveal that the reduced order models accurately describe the response characteristics of the composite material for a wide range of loading regimes.

show abstract

An efficient coarse‐grained parallel algorithm for global–local multiscale computations on massively parallel systems

Cited by 15 publications

References 34 publications

An efficient block preconditioner for Jacobian‐free global–local multiscale methods

An efficient block preconditioner for Jacobian‐free global–local multiscale methods

Multilayered grouping parallel algorithm for multiple‐level multiscale analyses

Identification of Optimal Reduced Order Homogenization Models for Failure of Heterogeneous Materials

Contact Info

Product

Resources

About