Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming 2011
DOI: 10.1145/1941553.1941567
|View full text |Cite
|
Sign up to set email alerts
|

All-window profiling and composable models of cache sharing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
26
0

Year Published

2012
2012
2015
2015

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 40 publications
(26 citation statements)
references
References 28 publications
0
26
0
Order By: Relevance
“…A faster, but less detailed, approach is to only simulate/model parts of the system, and in particular the memory system. Such methods are either trace driven [2,4,3,27] or use high-level data [29,16] similar to the data we use. Finally, the least detailed approach simply aims to identify which applications are sensitive to resource contention [28,17,11].…”
Section: Related Workmentioning
confidence: 99%
“…A faster, but less detailed, approach is to only simulate/model parts of the system, and in particular the memory system. Such methods are either trace driven [2,4,3,27] or use high-level data [29,16] similar to the data we use. Finally, the least detailed approach simply aims to identify which applications are sensitive to resource contention [28,17,11].…”
Section: Related Workmentioning
confidence: 99%
“…There are several models [2,4,3,17] using stack distance traces. Chandra et al [2] pioneered the field with a statistical model that estimates the probability that an access becomes a miss by prolonging its stack distance with the expected number of accesses performed by other applications.…”
Section: Related Workmentioning
confidence: 99%
“…This input data consist of the applications' fetch and hit rates, IPCs, and hit ratios as a function of their cache allocation, and can be acquired with low overhead on modern multicore machines [6]. This low-overhead data is in contrast to many existing methods for modeling cache sharing which rely on expensive data such as stack distance traces [2,4,3,17].…”
Section: Introductionmentioning
confidence: 99%
“…snapshots. Three recent papers have solved the problem of measuring the footprint in all execution windows and given a linear-time solution to compute the average footprint [5], [6], [16].…”
Section: Introductionmentioning
confidence: 99%
“…the additional misses due to sharing, can be computed from single-program statistics. This is known as the composable model because it uses a linear number of sequential tests to predict the performance of an exponential number of parallel co-runs [5]. In shared cache, the reuse distance in thread A is lengthened by the footprint of thread B.…”
Section: Introductionmentioning
confidence: 99%