Proceedings of the 19th Annual International Conference on Supercomputing 2005
DOI: 10.1145/1088149.1088161
|View full text |Cite
|
Sign up to set email alerts
|

Low-overhead call path profiling of unmodified, optimized code

Abstract: Call path profiling associates resource consumption with the calling context in which resources were consumed. We describe the design and implementation of a low-overhead call path profiler based on stack sampling. The profiler uses a novel sample-driven strategy for collecting frequency counts for call graph edges without instrumenting every procedure's code to count them. The data structures and algorithms used are efficient enough to construct the complete calling context tree exposed during sampling. The p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
64
0
1

Year Published

2009
2009
2015
2015

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 77 publications
(69 citation statements)
references
References 18 publications
0
64
0
1
Order By: Relevance
“…Complex function call transitions by setjmp/longjmp are known to break the orderly sequence of calls and returns as discussed in [20]. Therefore, several practical concerns must be addressed to keep the consistency of a loop stack within the call stack.…”
Section: To Track the Precise Loop Stack In A Real Programmentioning
confidence: 99%
“…Complex function call transitions by setjmp/longjmp are known to break the orderly sequence of calls and returns as discussed in [20]. Therefore, several practical concerns must be addressed to keep the consistency of a loop stack within the call stack.…”
Section: To Track the Precise Loop Stack In A Real Programmentioning
confidence: 99%
“…HPCToolkit uses a lightweight trampoline, as shown in Figure 1(a) [9]. Instead of pointing return addresses to a stack, the topmost function of a prefix is instrumented to return into a trampoline function.…”
Section: A Fast Call-path Unwindingmentioning
confidence: 99%
“…This event causes control flow to skip the return that would install the trampoline in a lower frame. The trampoline approach thus must instrument all non-local exits to routines, which requires more complex code analysis [10]. Our approach avoids this analysis by simply instrumenting all return addresses.…”
Section: A Fast Call-path Unwindingmentioning
confidence: 99%
See 1 more Smart Citation
“…This context contains the current path of the task through the distributed system. Then, statistical sampling is used to indirectly measure resource usage, and requires no intrusive OS-level instrumentation [20].…”
Section: Performance Profilesmentioning
confidence: 99%