2020
DOI: 10.48550/arxiv.2001.09995
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Automated Parallel Kernel Extraction from Dynamic Application Traces

Richard Uhrie,
Chaitali Chakrabarti,
John Brunhaver

Abstract: Modern program runtime is dominated by segments of repeating code called kernels. Kernels are accelerated by increasing memory locality, increasing data-parallelism, and exploiting producer-consumer parallelism among kernels -which requires hardware specialized for a particular class of kernels. Programming this hardware can be difficult, requiring that the kernels be identified and annotated in the code or translated to a domain-specific language. This paper describes a technique to automatically localize par… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 17 publications
0
1
0
Order By: Relevance
“…To do this, we begin by converting the user's application into the LLVM [28] intermediate representation (IR). The user's LLVM IR is then passed through TraceAtlas [49], an open source 2 toolchain for collecting and analyzing dynamic application traces from arbitrary application code. Using TraceAtlas, we modify the user's LLVM IR to include tracing instrumentation, compile a tracing executable, and execute it.…”
Section: Automated Compilation Processmentioning
confidence: 99%
“…To do this, we begin by converting the user's application into the LLVM [28] intermediate representation (IR). The user's LLVM IR is then passed through TraceAtlas [49], an open source 2 toolchain for collecting and analyzing dynamic application traces from arbitrary application code. Using TraceAtlas, we modify the user's LLVM IR to include tracing instrumentation, compile a tracing executable, and execute it.…”
Section: Automated Compilation Processmentioning
confidence: 99%