2016 Sixth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing (WOLFHP 2016
DOI: 10.1109/wolfhpc.2016.08
|View full text |Cite
|
Sign up to set email alerts
|

YASK—Yet Another Stencil Kernel: A Framework for HPC Stencil Code-Generation and Tuning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
32
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 53 publications
(32 citation statements)
references
References 14 publications
0
32
0
Order By: Relevance
“…On Xeon and Xeon Phi processors, one of the most highlyoptimized implementations was introduced by Yount [13], which uses a technique called "Vector Folding" that is suitable for wide-vector architectures. This implementation was further optimized and made available to public as the "YASK" framework [9]. We use this framework in our evaluation.…”
Section: Introductionmentioning
confidence: 99%
“…On Xeon and Xeon Phi processors, one of the most highlyoptimized implementations was introduced by Yount [13], which uses a technique called "Vector Folding" that is suitable for wide-vector architectures. This implementation was further optimized and made available to public as the "YASK" framework [9]. We use this framework in our evaluation.…”
Section: Introductionmentioning
confidence: 99%
“…Each backend transformation pass is based on manipulating an input AST and returning a new, different AST. One of the reasons behind this software engineering strategy, which is clearly more challenging than a template-based solution, is to ease the integration of external tools, such as the YASK stencil optimizer [Yount16]. We are currently in the process of integrating YASK to complement the DLE, so that YASK may replace some (but not all) DLE passes.…”
Section: Integration With Yaskmentioning
confidence: 99%
“…Creating the proper Python bindings in YASK so that Devito can drive the code generation process. It has been shown that real-world stencil codes optimised through YASK may achieve an exceptionally high fraction of the attainable machine peak [Yount15], [Yount16]. Further, initial prototyping (manual optimization of Devito-generated code through YASK) revealed that YASK may also outperform the loop optimization engine currently available in Devito, besides ensuring seamless performance portability across a range of computer architectures.…”
Section: Integration With Yaskmentioning
confidence: 99%
“…a combination of the number of potential loops blocking sizes for 3D scenario with any additional context, would create an intractable set of combinations that would take years to explore. Thus, we used the YASK framework [7] and its genetic algorithm (GA) auto-tuning system to find nearoptimum settings. For each desired result, the GA runs for at least three separate experiments to avoid finding a local minimum prematurely.…”
Section: Optimization Strategiesmentioning
confidence: 99%