Proceedings of the 42nd Annual International Symposium on Computer Architecture 2015
DOI: 10.1145/2749469.2750375
|View full text |Cite
|
Sign up to set email alerts
|

Flexible software profiling of GPU architectures

Abstract: To aid application characterization and architecture design space exploration, researchers and engineers have developed a wide range of tools for CPUs, including simulators, profilers, and binary instrumentation tools. With the advent of GPU computing, GPU manufacturers have developed similar tools leveraging hardware profiling and debugging hooks. To date, these tools are largely limited by the fixed menu of options provided by the tool developer and do not offer the user the flexibility to observe or act on … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 64 publications
(9 citation statements)
references
References 29 publications
0
9
0
Order By: Relevance
“…SPE subsumes software engineering activities that are applied to meet performance requirements and achieve improvements. Profiling tools can be used to measure performance (e.g., for energy consumption [252], [253], GPUs [254], responsiveness [255], and memory [256]).…”
Section: Optimization In Software Engineeringmentioning
confidence: 99%
“…SPE subsumes software engineering activities that are applied to meet performance requirements and achieve improvements. Profiling tools can be used to measure performance (e.g., for energy consumption [252], [253], GPUs [254], responsiveness [255], and memory [256]).…”
Section: Optimization In Software Engineeringmentioning
confidence: 99%
“…Moreover, SASSI is able to read the values residing in memory location and registers. As demonstrated in the paper [47], SASSI supports to build effective fine-grained profilers. However, SASSI has several limitations in its portability, expansibility, complexity, and coverage.…”
Section: Existing Gpu Profilers and Limitationsmentioning
confidence: 99%
“…As shown in Figure 10, the runtime overhead mostly ranges from 10× to 120×. It is much faster than simulators such as GPGPU-Sim that usually incurs 1-10 millions of slow down to the native execution [47].…”
Section: Tool's Overhead Analysismentioning
confidence: 99%
See 2 more Smart Citations