Efficient out‐of‐GPU memory strategies for solving matrix equation generated by method of moments

Topa, T.

doi:10.1049/el.2015.2175

Cited by 7 publications

(4 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The CPU sequential procedures for computing the elements of the system matrix have been mapped to parallel GPU platform as described in [20]. To enable the solution of relatively large-size problems with MoM-generated matrix exceeding the amount of memory available on the device, a hybrid out-of-GPU memory CULA-panel-based LU decomposition algorithm have been implemented [21]. The algorithm proceeds iteratively with the following two distinct phases: i) panel factorization, and ii) the update of the trailing submatrix.…”

Section: Hybrid Cpu/gpu Implementationmentioning

confidence: 99%

See 1 more Smart Citation

An Efficient Framework for Analysis of Wire-Grid Shielding Structures over a Broad Frequency Range

Karwowski¹,

Нога²,

Topa³

2016

RADIOENGINEERING

Self Cite

View full text Add to dashboard Cite

Section: Hybrid Cpu/gpu Implementationmentioning

confidence: 99%

“…1 explains how the CPU/GPU computations are organized. The interested reader is referred to [20], [21] and [23] for more details. …”

Section: Hybrid Cpu/gpu Implementationmentioning

confidence: 99%

An Efficient Framework for Analysis of Wire-Grid Shielding Structures over a Broad Frequency Range

Karwowski¹,

Нога²,

Topa³

2016

RADIOENGINEERING

Self Cite

View full text Add to dashboard Cite

“…During the last decades, much effort has been devoted to develop numerical methods for solving matrix equations. () Yao et al studied the solutions of the matrix equation AX = B with respect to semi‐tensor product . In Chiang, first by using the Kronecker product, some useful results of the solvability of the Sylvester‐like matrix equation AX + f ( X ) B = C were presented, and then the closed‐form solutions of this matrix equation were provided.…”

Section: Introductionmentioning

confidence: 99%

Reflexive periodic solutions of general periodic matrix equations

Hajarian

2019

Math Methods in App Sciences

View full text Add to dashboard Cite

Analysis and design of linear periodic control systems are closely related to the periodic matrix equations. The objective of this paper is to provide four new iterative methods based on the conjugate gradient normal equation error (CGNE), conjugate gradient normal equation residual (CGNR), and least‐squares QR factorization (LSQR) algorithms to find the reflexive periodic solutions (X1,Y1,X2,Y2,…,Xσ,Yσ) of the general periodic matrix equations ∑s=0σ−1Ai,sXi+sBi,s+∑t=0σ−1Ci,tYi+tDi,t=Ni, for i = 1,2,…,σ. The iterative methods are guaranteed to converge in a finite number of steps in the absence of round‐off errors. Finally, some numerical results are performed to illustrate the efficiency and feasibility of new methods.

show abstract

“…Compared with a single core, a speedup of 15 can be obtained. In [20], an efficient out-of-GPU memory scheme for solving matrix equations generated by MoM was presented, and it also obtains a good speedup on a single CPU/GPU platform. In [21,22], an out-of-core scheme between RAM and hard-disk drives (HDD) using RWGs and higher-order basis functions (HOBs) was adopted to break the limitation of RAM and improve the capability of parallel MoM to solve larger EM problems.…”

Section: Introductionmentioning

confidence: 99%

An Efficient GPU-Based Out-of-Core LU Solver of Parallel Higher-Order Method of Moments for Solving Airborne Array Problems

Lin

Chen

Zhang

et al. 2017

International Journal of Antennas and Propagation

View full text Add to dashboard Cite

The parallel higher-order method of moments (HoMoM) with a GPU accelerated out-of-core LU solver is presented for analysis of radiation characteristics of a 1000-element antenna array over a full-size airplane. A parallel framework involving MPI and CUDA is adopted to ensure that the procedures run on a hybrid CPU/GPU cluster. An efficient two-level out-of-core scheme is designed to break the bottleneck of both GPU memory and physical memory when solving electrically large and complex problems. To hide communication time between CPU and GPU, asynchronous communications are chosen to enable overlapping between communication and computation. For large problems that cannot fit in GPU memory or physical memory, the two-level out-ofcore LU solver is able to achieve a speedup of about 1.6x over the traditional out-of-core LU solver based on a highly optimized math library.

show abstract

Efficient out‐of‐GPU memory strategies for solving matrix equation generated by method of moments

Cited by 7 publications

References 7 publications

An Efficient Framework for Analysis of Wire-Grid Shielding Structures over a Broad Frequency Range

An Efficient Framework for Analysis of Wire-Grid Shielding Structures over a Broad Frequency Range

Reflexive periodic solutions of general periodic matrix equations

An Efficient GPU-Based Out-of-Core LU Solver of Parallel Higher-Order Method of Moments for Solving Airborne Array Problems

Contact Info

Product

Resources

About