2019
DOI: 10.1145/3264491
|View full text |Cite
|
Sign up to set email alerts
|

Plasma

Abstract: The recent version of the Parallel Linear Algebra Software for Multicore Architectures (PLASMA) library is based on tasks with dependencies from the OpenMP standard. The main functionality of the library is presented. Extensive benchmarks are targeted on three recent multicore and manycore architectures, namely, an Intel Xeon, Intel Xeon Phi, and IBM POWER 8 processors.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 40 publications
(6 citation statements)
references
References 36 publications
0
6
0
Order By: Relevance
“…We now experiment with the state-of-the-art parallel numerical linear algebra library PLASMA [6]. PLASMA includes much of the functionality of the BLAS and LAPACK, but partitions matrices into t \times t blocks called tiles.…”
Section: Matrix Multiplicationmentioning
confidence: 99%
See 1 more Smart Citation
“…We now experiment with the state-of-the-art parallel numerical linear algebra library PLASMA [6]. PLASMA includes much of the functionality of the BLAS and LAPACK, but partitions matrices into t \times t blocks called tiles.…”
Section: Matrix Multiplicationmentioning
confidence: 99%
“…In section 5.1 we experiment with summation using MATLAB R2018b. In section 5.2 we present performance and accuracy results with PLASMA [6], a state-ofthe-art numerical linear algebra library that we have modified by integrating FABsum.…”
mentioning
confidence: 99%
“…In our calculations a custom implementation of this algorithm inspired by the PLASMA library [39] was utilized with custom vectorization of quaddouble arithmetics based on Bailey's quad-double precision algorithms [40]. Incorporation of both the communication-avoiding variant of Aasen's factorization and vectorized quad-double arithmetics constitutes a very efficient implementation of dense matrix factorization -an essential component of the inverse iteration method with nonorthogonal bases.…”
Section: Methodsmentioning
confidence: 99%
“…Paper [17] presents PLASMA-the Parallel Linear Algebra Software for Multicore Architectures-a version which is an OpenMP task based implementation adopting a tilebased approach to storage, along with algorithms that operate on tiles and use OpenMP for dynamic scheduling based on tasks with dependencies and priorities. Detailed assessment of the software performance is presented in the paper using three platforms with 2 × Intel Xeon CPU E5-2650 v3 CPUs at 2.3 GHz, Intel Xeon Phi 7250 and 2 × IBM POWER8 CPUs at 3.5 GHz, respectively, using gcc compared to MKL (for Intel) and ESSL (for IBM).…”
Section: Openmp Related Framework and Layers For Parallelizationmentioning
confidence: 99%