2016
DOI: 10.15439/2016f436
|View full text |Cite
|
Sign up to set email alerts
|

Parallelizing nested loops on the Intel Xeon Phi on the example of the dense WZ factorization

Abstract: Abstract-In this article we evaluate some strategies of parallelizing nested loops on Intel Xeon Phi on the example of the WZ factorization for dense matrices. We employ both parallelism and vectorization to accelerate nested loops on manycore coprocessor.For random dense square matrices with the dominant diagonal we report the execution time and the performance of the nested loops. Numerical experiments show that the vectorization that is efficiently exploiting SIMD vector units do not always improve the appl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 7 publications
(7 reference statements)
0
5
0
Order By: Relevance
“…W Z factorization has been applied in modeling of Markov chains aside its parallelization usage [27]. W Z factorization offers parallelization in solving both sparse and dense linear system to enhance performance using OpenMP, GPU, CUDA or EDK HW/SW codesign architecture [28,29,12]. Then, Yalamov [24] presented that W Z factorization is faster on computer with a parallel architecture than any other matrix factorization methods.…”
Section: Importance Of W Z Factorizationmentioning
confidence: 99%
See 1 more Smart Citation
“…W Z factorization has been applied in modeling of Markov chains aside its parallelization usage [27]. W Z factorization offers parallelization in solving both sparse and dense linear system to enhance performance using OpenMP, GPU, CUDA or EDK HW/SW codesign architecture [28,29,12]. Then, Yalamov [24] presented that W Z factorization is faster on computer with a parallel architecture than any other matrix factorization methods.…”
Section: Importance Of W Z Factorizationmentioning
confidence: 99%
“…Demeure [44] first posited the term hourglass matrix in describing a matrix derived from factorizing a square matrix via quadrant interlocking factorization or bowtie-hourglass factorization. It was further elucidated that hourglass matrix is synonymous to Z-matrix which can be partitioned into blocks structured Z-system [15,28]. Unfortunately, there are changes in structure of Z-matrix from W Z factorization which depend on the type of matrix (Toeplitz, Hankel, Hermitian, centrosymmetric, diagonally dominant or tridiagonal matrix) being factorized.…”
Section: W H Factorizationmentioning
confidence: 99%
“…When parallelizing such cases, it is first necessary to determine at which level of the cycles we want to parallelize, the strategies of which are illustrated in Table 1. [4,7,5,19]…”
Section: Nested Loop Parallelizationmentioning
confidence: 99%
“…• wz -WZ factorization: dense, square, non-structured matrix factorization algorithm [20], • edge detect -2D-convolution routine to expose edge information from the UTDSP Benchmark suite 8 ,…”
Section: Experimental Studymentioning
confidence: 99%