2011 IEEE International Parallel &Amp; Distributed Processing Symposium 2011
DOI: 10.1109/ipdps.2011.68
|View full text |Cite
|
Sign up to set email alerts
|

A Practical Approach for Performance Analysis of Shared-Memory Programs

Abstract: Abstract-Parallel programming has transcended from HPC into mainstream, enabled by a growing number of programming models, languages and methodologies, as well as the availability of multicore systems. However, performance analysis of parallel programs is still difficult, especially for large and complex programs, or applications developed using different programming models. This paper proposes a simple analytical model for studying the speedup of shared-memory programs on multicore systems. The proposed model… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 17 publications
(18 citation statements)
references
References 31 publications
0
18
0
Order By: Relevance
“…Using extensive measurement analysis on state of the art multicore systems [8], [9], we conclude that for weak-scaling programs the last-level misses, number of cycles unrelated to memory contention and work cycles do not change significantly when n changes. Furthermore, we study the pattern of burstiness of the memory requests and conclude that large parallel programs do not exhibit bursty memory traffic [9].…”
Section: Model Of Memory Contentionmentioning
confidence: 97%
See 4 more Smart Citations
“…Using extensive measurement analysis on state of the art multicore systems [8], [9], we conclude that for weak-scaling programs the last-level misses, number of cycles unrelated to memory contention and work cycles do not change significantly when n changes. Furthermore, we study the pattern of burstiness of the memory requests and conclude that large parallel programs do not exhibit bursty memory traffic [9].…”
Section: Model Of Memory Contentionmentioning
confidence: 97%
“…We define ω(n), the memory contention factor, as the number of threads busy due to memory overhead to the number of threads busy due to useful work. With these notations, following the derivations described in [8], the speedup of a shared-memory program is:…”
Section: Parallelism and Energy Performance Models A Model Of Pmentioning
confidence: 99%
See 3 more Smart Citations