2008
DOI: 10.1007/s00791-008-0095-z
|View full text |Cite
|
Sign up to set email alerts
|

Distributed $${{\mathcal H}^2}$$ -matrices for non-local operators

Abstract: H 2 -matrices can be used to approximate dense n × n matrices resulting from the discretization of certain non-local operators (e.g., Fredholm-type integral operators) in O(nk) units of storage, where k is a parameter controlling the accuracy of the approximation. Since typically k n holds, this representation is much more efficient than the conventional representation by a two-dimensional array. For very large problem dimensions, the amount of available storage becomes a limiting factor for practical algorith… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(3 citation statements)
references
References 17 publications
0
3
0
Order By: Relevance
“…For the 2D tests, the problem size is 2 20 . With P = 8 GPUs, the local problem size is only p N = 2 17 and results in an efficiency reduction to near 50%. On 32 GPUs, the local problem size is p N = 2 15 and the limit of strong scalability is essentially reached: there is very little local work to do and the whole operation takes a few ms, spent mostly in communications.…”
Section: Strong Scalabilitymentioning
confidence: 99%
See 1 more Smart Citation
“…For the 2D tests, the problem size is 2 20 . With P = 8 GPUs, the local problem size is only p N = 2 17 and results in an efficiency reduction to near 50%. On 32 GPUs, the local problem size is p N = 2 15 and the limit of strong scalability is essentially reached: there is very little local work to do and the whole operation takes a few ms, spent mostly in communications.…”
Section: Strong Scalabilitymentioning
confidence: 99%
“…A high quality software for distributed-memory environments that targets large scale problems is STRUMPACK [5]. Distributed memory algorithms for constructing and operating on these matrices were proposed in [17,37,50]. Dynamic run-time systems to manage the scheduling of the operations of the irregular hierarchical matrix structure in a more convenient fashion through explicit task graphs are presented in [7,18].…”
Section: Introductionmentioning
confidence: 99%
“…For H 2 -matrices, which are special variants of H-matrices, a parallel algorithm on distributed memory clusters was proposed in Ref. [17]. The algorithm was implemented using flat-MPI approach, and the parallel scalability was also evaluated on a small cluster system with 16 single-CPU machines.…”
Section: ) To O(n) ∼ O(n Log N)mentioning
confidence: 99%