2017
DOI: 10.1007/978-3-319-58667-0_2
|View full text |Cite
|
Sign up to set email alerts
|

Tile Low Rank Cholesky Factorization for Climate/Weather Modeling Applications on Manycore Architectures

Abstract: Covariance matrices are ubiquitous in computational science and engineering. In particular, large covariance matrices arise from multivariate spatial data sets, for instance, in climate/weather modeling applications to improve prediction using statistical methods and spatial data. One of the most time-consuming computational steps consists in calculating the Cholesky factorization of the symmetric, positive-definite covariance matrix problem. The structure of such covariance matrices is also often data-sparse,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
53
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 42 publications
(53 citation statements)
references
References 19 publications
0
53
0
Order By: Relevance
“…Block Low-Rank compression has been investigated for dense matrices [18,19], and for sparse linear systems when using a multifrontal method [20,21]. Considering that these approaches are similar to the current study, a detailed comparison will be described in Section 6.…”
Section: Introductionmentioning
confidence: 93%
“…Block Low-Rank compression has been investigated for dense matrices [18,19], and for sparse linear systems when using a multifrontal method [20,21]. Considering that these approaches are similar to the current study, a detailed comparison will be described in Section 6.…”
Section: Introductionmentioning
confidence: 93%
“…More recently, with the emergence of asynchronous task-based programming models, these hierarchical low-rank matrix approximations algorithms have been revisited by flattening their recursions and exposing them to task-based runtime systems such as Intel Threading Building Blocks (Intel TBB) [22] and OpenMP [3]. While these dynamic runtimes permit to mitigate the overhead from the bus bandwidth saturation on single shared-memory nodes, they do not support distributed-memory systems.…”
Section: Related Workmentioning
confidence: 99%
“…This paper introduces the HiCMA library, the first implementation of taskbased tile low-rank Cholesky factorization on distributed-memory systems. Compared to the initial implementation on shared-memory environment [3] based on OpenMP, this paper uses instead the StarPU [9] dynamic runtime system to asynchronously schedule computational tasks across interconnected remote nodes. This highly productive association of task-based programming model with dynamic runtime systems permits to tackle in a systematic way advanced hardware systems by abstracting their complexity from the numerical library developers.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations