Proceedings of the 31st ACM SIGPLAN Conference on Programming Language Design and Implementation 2010
DOI: 10.1145/1806596.1806606
|View full text |Cite
|
Sign up to set email alerts
|

A GPGPU compiler for memory optimization and parallelism management

Abstract: This paper presents a novel optimizing compiler for general purpose computation on graphics processing units (GPGPU). It addresses two major challenges of developing high performance GPGPU programs: effective utilization of GPU memory hierarchy and judicious management of parallelism.The input to our compiler is a naïve GPU kernel function, which is functionally correct but without any consideration for performance optimization. The compiler analyzes the code, identifies its memory access patterns, and generat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
107
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 210 publications
(107 citation statements)
references
References 18 publications
0
107
0
Order By: Relevance
“…Recently we also see some work in automating such manual optimizations [30,31]. Such high-level optimizations from the software side are complementary to our Elastic Pipeline proposal, albeit with the limitation discussed in Sect.…”
Section: Related Workmentioning
confidence: 99%
“…Recently we also see some work in automating such manual optimizations [30,31]. Such high-level optimizations from the software side are complementary to our Elastic Pipeline proposal, albeit with the limitation discussed in Sect.…”
Section: Related Workmentioning
confidence: 99%
“…The memory accesses to these arrays will be monitored at runtime for misspeculation check. Although the compiler can identify the arrays that have irregular access patterns [18], not all of them will cause crossiteration dependence at runtime. Programmers can better identify which arrays need to be monitored.…”
Section: Irregular Memory Accessesmentioning
confidence: 99%
“…Baskaran et al [3] use a polyhedral compiler model to optimize affine memory accesses in regular loops. Yang et al [18] presented an optimizing compiler for memory bandwidth enhancement, data reuse, parallelism management, etc.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Recent research on developing programming and compiler techniques for GPUs focuses on (among others) general programming principles [5,9,14], cost modeling and analysis [1,15,3], automatic code generation [2,11], and performance tuning and optimization [4,6,12,19]. However, these research efforts are almost exclusively limited to DOALL loops.…”
Section: Introductionmentioning
confidence: 99%