2013
DOI: 10.1007/978-3-642-38718-0_34
|View full text |Cite
|
Sign up to set email alerts
|

Designing Linear Algebra Algorithms by Transformation: Mechanizing the Expert Developer

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
43
0

Year Published

2013
2013
2015
2015

Publication Types

Select...
4
2
1

Relationship

3
4

Authors

Journals

citations
Cited by 21 publications
(43 citation statements)
references
References 12 publications
0
43
0
Order By: Relevance
“…The FLAME interfaces for indexing are also included to omit indexing in favor of reasoning about matrix partitions. The benefit of these interfaces is that parallelizing most sequential DLA algorithms in high-performance Elemental code is rote (this is described and automated in [6,7]). An expert needs to decide which distributions are efficient and how to redistribute between them.…”
Section: Experiments With Interfacesmentioning
confidence: 99%
See 3 more Smart Citations
“…The FLAME interfaces for indexing are also included to omit indexing in favor of reasoning about matrix partitions. The benefit of these interfaces is that parallelizing most sequential DLA algorithms in high-performance Elemental code is rote (this is described and automated in [6,7]). An expert needs to decide which distributions are efficient and how to redistribute between them.…”
Section: Experiments With Interfacesmentioning
confidence: 99%
“…With Elemental experts predict runtime to choose which parallelization schemes to use or optimizations to apply. Estimates are first-order approximations in terms of the amount of computation performed and the amount of data communicated between processes [6,7]. Thanks to the interfaces in Elemental, BLIS, and libflame, relatively rough cost estimates are good enough to guide experts without having to implement, compile, run, and time code.…”
Section: Encoding Expert Knowledge For Automatic Code Generationmentioning
confidence: 99%
See 2 more Smart Citations
“…Another different work seeks to update BLAS by extending it with additional functionalities [26]. Build-to-order BLAS [27] and Design-by-transformation BLAS [28] approach the problem from a different angle. Their goal is to generate optimized and tuned BLAS-like functions from high level kernel specifications.…”
Section: Related Workmentioning
confidence: 99%