Accuracy of performance counter measurements

Zaparanuks, Dmitrijs; Jovic, Milan; Hauswirth, Matthias

doi:10.1109/ispass.2009.4919635

Cited by 62 publications

(42 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In contrast, counts derived from simulations conducted using ValGrind [9] yielded consistent and reasonable values. Our experience is consistent with the observations of others who indicates performance counters can become inaccurate with more complex sequences of code such as those found within CoMD [15,16].…”

Section: Methodssupporting

confidence: 92%

Preliminary Investigation of Mobile System Features Potentially Relevant to HPC

Pruitt

Freudenthal

2016

2016 4th International Workshop on Energy Efficient Supercomputing (E2SC)

View full text Add to dashboard Cite

Energy consumption's increasing importance in scientific computing has driven an interest in developing energy efficient high performance systems. Energy constraints of mobile computing has motivated the design and evolution of low-power computing systems capable of supporting a variety of compute-intensive user interfaces and applications. Others have observed the evolution of mobile devices to also provide high performance [14]. Their work has primarily examined the performance and efficiency of compute-intensive scientific programs executed either on mobile systems or hybrids of mobile CPUs grafted into non-mobile (sometimes HPC) systems [6,12,14].This report describes an investigation of performance and energy consumption of a single scientific code on five high performance and mobile systems with the objective of identifying the performance and energy efficiency implications of a variety of architectural features. The results of this pilot study suggest that ISA is less significant than other specific aspects of system architecture in achieving high performance at high efficiency. The strategy employed in this study may be extended to other scientific applications with a variety of memory access, computation, and communication properties.

show abstract

Section: Methodssupporting

confidence: 92%

Preliminary Investigation of Mobile System Features Potentially Relevant to HPC

Pruitt

Freudenthal

2016

2016 4th International Workshop on Energy Efficient Supercomputing (E2SC)

View full text Add to dashboard Cite

show abstract

“…To prevent the processor clock frequency from changing during mea surements, frequency scaling should be disabled [Zaparanuks et al, 2009]. Hopper [2013] points out that theoretical hardware performance values do not always reflect actual application performance due to many factors, including caching effects, data locality, and instruction sequences, among other things.…”

Section: Cpu Utilization and Memory Latencymentioning

confidence: 99%

“…Zaparanuks et al [2009] conducted a well-designed experiment to determine the monitoring overhead of a number of configurations that allow user-level access to per-thread hardware counters in Linux. They observed that the monitoring overhead differs drastically between configurations.…”

Section: Monitoring Overheadmentioning

confidence: 99%

See 1 more Smart Citation

The ghost in the machine : don’t let it haunt your software performance measurements

Pieterse¹,

Flater²

2014

View full text Add to dashboard Cite

This paper describes pitfalls, issues, and methodology for measuring software performance. Ideally, measurement should be performed and reported in such a way that others will be able to reproduce the results in order to confirm their validity. We aim to motivate scientists to apply the necessary rigor to the design and execution of their software performance measurements to achieve reliable results. Repeatability of experiments, comparability of reported results, and verifiability of claims that are based on such results can be achieved only when measurements and reporting procedures can be trusted. In short, this paper urges the reader to measure the right performance and to measure the performance right.

show abstract

“…The most difficult problem these systems have to solve is that of shared memory interactions, something we can completely ignore because our source code is purely functional. In addition, some of these systems also try to replay the scheduling of threads (a requirement in our case), but they do so by using hardware counters [5,8], which makes them hardware dependent and subject to inaccurate measurements [12,17]. Another benefit of our approach is that we can modify the original program at will, as long as it produces the same allocations, in order to gather more information, and the replay will still be valid.…”

Section: Related Workmentioning

confidence: 99%

Repeating History: Execution Replay for Parallel Haskell Programs

Ferreiro

Janjić

Castro

et al. 2013

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. Parallel profiling tools, such as ThreadScope for Parallel Haskell, allow programmers to obtain information about the performance of their parallel programs. However, the information they provide is not always sufficiently detailed to precisely pinpoint the cause of some performance problems. Often, this is because the cost of obtaining that information would be prohibitive for a complete program execution. In this paper, we adapt the well-known technique of execution replay to make it possible to simulate a previous run of a program. We ensure that the non-deterministic parallel behaviour of the application is properly emulated while the deterministic functional code is run unmodified. In this way, we can gather additional data about the behaviour of a parallel program by replaying some parts of it with more detailed profiling information. We exploit this ability to identify performance bottlenecks in a quicksort implementation, and to derive a version that gives better speedups on multicore machines.

show abstract

Accuracy of performance counter measurements

Cited by 62 publications

References 14 publications

Preliminary Investigation of Mobile System Features Potentially Relevant to HPC

Preliminary Investigation of Mobile System Features Potentially Relevant to HPC

The ghost in the machine : don’t let it haunt your software performance measurements

Repeating History: Execution Replay for Parallel Haskell Programs

Contact Info

Product

Resources

About