Proceedings of the 43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation 2022
DOI: 10.1145/3519939.3523439
|View full text |Cite
|
Sign up to set email alerts
|

Bind the gap: compiling real software to hardware FFT accelerators

Abstract: Specialized hardware accelerators continue to be a source of performance improvement. However, such specialization comes at a programming price. The fundamental issue is that of a mismatch between the diversity of user code and the functionality of fixed hardware, limiting its wider uptake.Here we focus on a particular set of accelerators: those for Fast Fourier Transforms. We present FACC (Fourier ACcelerator Compiler), a novel approach to automatically map legacy code to Fourier Transform accelerators. It au… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 109 publications
0
3
0
Order By: Relevance
“…Because we cannot express general computation in TACO, there is a need to identify the code regions that can be lifted and accelerated. We use prior work in neural program classification [69] to determine which parts of the program represent tensor operations.…”
Section: Classificationmentioning
confidence: 99%
See 1 more Smart Citation
“…Because we cannot express general computation in TACO, there is a need to identify the code regions that can be lifted and accelerated. We use prior work in neural program classification [69] to determine which parts of the program represent tensor operations.…”
Section: Classificationmentioning
confidence: 99%
“…We extend the method set out in FACC [69], where inputs are randomly generated according to manually given constraints dictating the length of arrays and favoring smaller values to make evaluation faster. We constrain arrays to be of size 4096, and fix tensor-dimensions to be equal (e.g., a 2-dimensional tensor is of size 64 × 64).…”
Section: Specificationmentioning
confidence: 99%
“…KernelFarer [23] works at the program level and restricts its attention to just GEMM API targets, but is more robust than IDL matching significantly more user code. This robustness is extended further in [55], [40] which uses behavioral equivalence to match code. Such approaches, however, are intrinsically limited as they focus on fixed APIs rather than the open-ended nature of DSLs and their IRs.…”
Section: Related Workmentioning
confidence: 99%