1997 IEEE International Performance, Computing and Communications Conference
DOI: 10.1109/pccc.1997.581530
|View full text |Cite
|
Sign up to set email alerts
|

Parsim: a parallel trace-driven simulation facility for fast and accurate performance analysis studies

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
6
0

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(6 citation statements)
references
References 7 publications
0
6
0
Order By: Relevance
“…(Application of the fixed, 7000-instruction warmup to cache state was demonstrated to be untrustworthy in Haskins, Jr. and Skadron [2001].) Nguyen et al [1997], on the other hand, approach the problem of warmup analytically as a part of the trace-driven PARSIM parallel microprocessor simulation system. Their formula calculates a function of the cache block width, associativity, the average population density of memory references within the instruction stream, and the average steady-state cache miss ratio.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…(Application of the fixed, 7000-instruction warmup to cache state was demonstrated to be untrustworthy in Haskins, Jr. and Skadron [2001].) Nguyen et al [1997], on the other hand, approach the problem of warmup analytically as a part of the trace-driven PARSIM parallel microprocessor simulation system. Their formula calculates a function of the cache block width, associativity, the average population density of memory references within the instruction stream, and the average steady-state cache miss ratio.…”
Section: Related Workmentioning
confidence: 99%
“…To accelerate sampled simulation even further, one can avoid FULL-WARMUP by only modeling those interactions that occur within a certain number of instructions prior to each sample cluster [Conte et al 1996;Crowley and Baer 1999;Eeckhout et al 2003;Skadron 2001, 2003;Kessler et al 1991;Nguyen et al 1997]. This approach exploits temporal locality [Hennessy and Patterson 1995]: the propensity of programs to demonstrate a strong correlation between recency of use and next use, for example, of cache blocks.…”
Section: Introductionmentioning
confidence: 99%
“…Then Nguyen et al [10] extended this idea to full trace analysis in which intervals overlap each other to form warmup phases. They also devised a heuristic method for their PARSIM to determine the length of a warmup phase using L1 cache hit ratio as the measure to estimate how each simulator node is warmed.…”
Section: Introductionmentioning
confidence: 99%
“…Nguyen et al [12] on the other hand, approach the problem of warmup analytically as a part of the trace-driven PARSIM parallel microprocessor simulation system. Their formula calculates a function of the cache block width, associativity, the average population density of memory references within the instruction stream, and the average steady-state cache miss ratio.…”
Section: Related Workmentioning
confidence: 99%
“…One method for further accelerating sampled simulations is to avoid fullwarmup by only modeling those interactions that occur within a given number of instructions prior to the cluster [3], [5], [8], [12]. Our technique makes the determination of when to engage cache and branch predictor warmup by exploiting memory reference reuse latencies (MRRL)-a measurement of the number of instructions that elapse between successive references to the same address.…”
Section: Introductionmentioning
confidence: 99%