2009 IEEE International Symposium on Performance Analysis of Systems and Software 2009
DOI: 10.1109/ispass.2009.4919635
|View full text |Cite
|
Sign up to set email alerts
|

Accuracy of performance counter measurements

Abstract: Many workload characterization studies depend on accurate measurements of the cost of executing a piece of code. Often these measurements are conducted using infrastructures to access hardware performance counters. Most modern processors provide such counters to count micro-architectural events such as retired instructions or clock cycles. These counters can be difficult to configure, may not be programmable or readable from user-level code, and can not discriminate between events caused by different software … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
40
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 62 publications
(42 citation statements)
references
References 14 publications
2
40
0
Order By: Relevance
“…In contrast, counts derived from simulations conducted using ValGrind [9] yielded consistent and reasonable values. Our experience is consistent with the observations of others who indicates performance counters can become inaccurate with more complex sequences of code such as those found within CoMD [15,16].…”
Section: Methodssupporting
confidence: 92%
“…In contrast, counts derived from simulations conducted using ValGrind [9] yielded consistent and reasonable values. Our experience is consistent with the observations of others who indicates performance counters can become inaccurate with more complex sequences of code such as those found within CoMD [15,16].…”
Section: Methodssupporting
confidence: 92%
“…To prevent the processor clock frequency from changing during mea surements, frequency scaling should be disabled [Zaparanuks et al, 2009]. Hopper [2013] points out that theoretical hardware performance values do not always reflect actual application performance due to many factors, including caching effects, data locality, and instruction sequences, among other things.…”
Section: Cpu Utilization and Memory Latencymentioning
confidence: 99%
“…Zaparanuks et al [2009] conducted a well-designed experiment to determine the monitoring overhead of a number of configurations that allow user-level access to per-thread hardware counters in Linux. They observed that the monitoring overhead differs drastically between configurations.…”
Section: Monitoring Overheadmentioning
confidence: 99%
See 1 more Smart Citation
“…The most difficult problem these systems have to solve is that of shared memory interactions, something we can completely ignore because our source code is purely functional. In addition, some of these systems also try to replay the scheduling of threads (a requirement in our case), but they do so by using hardware counters [5,8], which makes them hardware dependent and subject to inaccurate measurements [12,17]. Another benefit of our approach is that we can modify the original program at will, as long as it produces the same allocations, in order to gather more information, and the replay will still be valid.…”
Section: Related Workmentioning
confidence: 99%