Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques 2018
DOI: 10.1145/3243176.3243192
|View full text |Cite
|
Sign up to set email alerts
|

Stencil codes on a vector length agnostic architecture

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
15
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3
2

Relationship

2
6

Authors

Journals

citations
Cited by 20 publications
(15 citation statements)
references
References 19 publications
0
15
0
Order By: Relevance
“…Furthermore, the constant for the multiplication (line 12) and the stencil instructions (line 14) are sent to the SPU. The streams for all the SPU are configured inside the loop (lines [22][23][24][25][26][27][28][29]. In this example, four streams are configured: three input streams to load the elements at A…”
Section: An Example: Jacobi-2d Stencilmentioning
confidence: 99%
“…Furthermore, the constant for the multiplication (line 12) and the stencil instructions (line 14) are sent to the SPU. The streams for all the SPU are configured inside the loop (lines [22][23][24][25][26][27][28][29]. In this example, four streams are configured: three input streams to load the elements at A…”
Section: An Example: Jacobi-2d Stencilmentioning
confidence: 99%
“…A wide body of research has focused on studying and analyzing stencil computations [3,16,18,22,26,32,38]. PIMS [22] is the most closely-related work to Casper.…”
Section: Related Workmentioning
confidence: 99%
“…We also evaluate the impact of compiler auto-vectorization using the recently proposed Scalable Vector Extension (SVE) ISA [53]. SVE is vector length agnostic, meaning that a single binary can run on any target vector length [4]. Therefore, we evaluate a scalar binary and an SVE-enabled binary with vector lengths of 128, 256, and 512 bits.…”
Section: Performance Evaluationmentioning
confidence: 99%