2018
DOI: 10.1109/jproc.2018.2868961
|View full text |Cite
|
Sign up to set email alerts
|

Autotuning Numerical Dense Linear Algebra for Batched Computation With GPU Hardware Accelerators

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(5 citation statements)
references
References 29 publications
0
5
0
Order By: Relevance
“…Another promising direction is automatically auto-tuning the algorithmic parameters of a program based upon the downstream choice of hardware. This facilitates easier deployment by tailoring the program to achieve good performance and load balancing on a variety of hardware (Dongarra et al, 2018;Clint Whaley et al, 2001;Asanović et al, 2006;Ansel et al, 2014).…”
Section: A Software Revolutionmentioning
confidence: 99%
“…Another promising direction is automatically auto-tuning the algorithmic parameters of a program based upon the downstream choice of hardware. This facilitates easier deployment by tailoring the program to achieve good performance and load balancing on a variety of hardware (Dongarra et al, 2018;Clint Whaley et al, 2001;Asanović et al, 2006;Ansel et al, 2014).…”
Section: A Software Revolutionmentioning
confidence: 99%
“…Given the diverse, evolving, and possibly heterogeneous architectures on which software must run, automatic ways to select the various algorithmic parameters will be increasingly needed in order to achieve good performance, energy efficiency, load balancing, and so on. Autotuning is already routinely used for core numerical linear algebra algorithms; see, e.g., [100], [101], and the references therein.…”
Section: Towards Hpc's Next Scalementioning
confidence: 99%
“…Autotuning is already routinely used for core numerical linear algebra algorithms, see, e.g. [99,100], and references therein.…”
Section: (A) Asynchronous Algorithmsmentioning
confidence: 99%
“…In numerical linear algebra the term "batched computation" is well-established, signifying a simultaneous processing of a large quantity of relatively small problems, e.g., the LU and the Cholesky factorizations [15] and the corresponding linear system solving [16] on the GPUs, with appropriate data layouts. It is therefore both justifiable and convenient to reuse the term in the present context.…”
Section: Introductionmentioning
confidence: 99%