2014
DOI: 10.1007/978-3-319-09873-9_42
|View full text |Cite
|
Sign up to set email alerts
|

Parallel Computation of Echelon Forms

Abstract: We propose efficient parallel algorithms and implementations on shared memory architectures of LU factorization over a finite field. Compared to the corresponding numerical routines, we have identified three main difficulties specific to linear algebra over finite fields. First, the arithmetic complexity could be dominated by modular reductions. Therefore, it is mandatory to delay as much as possible these reductions while mixing fine-grain parallelizations of tiled iterative and recursive algorithms. Second, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
1

Year Published

2014
2014
2020
2020

Publication Types

Select...
3
1
1

Relationship

3
2

Authors

Journals

citations
Cited by 13 publications
(20 citation statements)
references
References 12 publications
0
19
1
Order By: Relevance
“…In section 3 we study different iterative and recursive variants and cutting strategies for the parallel matrix multiplication pfgemm and compare them with our best iterative standard parallel matrix multiplication [11]. In section 4 we show three parallel algorithms for the pftrsm routine: an iterative variant, a recursive variant and a hybrid combination.…”
Section: Methodology Of Experimentsmentioning
confidence: 99%
See 4 more Smart Citations
“…In section 3 we study different iterative and recursive variants and cutting strategies for the parallel matrix multiplication pfgemm and compare them with our best iterative standard parallel matrix multiplication [11]. In section 4 we show three parallel algorithms for the pftrsm routine: an iterative variant, a recursive variant and a hybrid combination.…”
Section: Methodology Of Experimentsmentioning
confidence: 99%
“…This is one argument in favor of a coarse granularity in our algorithms. Table 1 taken from [11] shows the impact of the block size for iterative and recur-…”
Section: Ingredients For the Design Of Parallel Kernelsmentioning
confidence: 99%
See 3 more Smart Citations