1996
DOI: 10.1109/71.553272
|View full text |Cite
|
Sign up to set email alerts
|

Characterizing the memory behavior of compiler-parallelized applications

Abstract: Compiler-parallelized applications are increasing in importance as moderate-scale multiprocessors become common. This paper evaluates how features of advanced memory systems (e.g., longer cache lines) impact memory system behavior for applications amenable to compiler parallelization. Using full-sized input data sets and applications taken from standard benchmark suites, we measure statistics such as speedups, synchronization and load imbalance, causes of cache misses, cache line utilization, data tra c and me… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
10
0

Year Published

1998
1998
2005
2005

Publication Types

Select...
3
3
3

Relationship

0
9

Authors

Journals

citations
Cited by 20 publications
(10 citation statements)
references
References 21 publications
0
10
0
Order By: Relevance
“…To improve memory hierarchy performance, various manual and automated optimization techniques have been developed for uniprocessors and parallel systems [9]- [11]. To exploit the spatial and temporal locality [12] properties, the data access order is changed by reordering the computation.…”
Section: A Memory Hierarchy Performance Optimizationsmentioning
confidence: 99%
“…To improve memory hierarchy performance, various manual and automated optimization techniques have been developed for uniprocessors and parallel systems [9]- [11]. To exploit the spatial and temporal locality [12] properties, the data access order is changed by reordering the computation.…”
Section: A Memory Hierarchy Performance Optimizationsmentioning
confidence: 99%
“…Many systematic software-oriented memory management approaches exist in literature but they do not focus on the combination of performance and overall power, 'partly sponsored by the Esprit ESDLPD project 25518 : DAB-LP 0-7803-5650-0/99/$10.00 0 1999 IEEE which is vital as motivated above [23]. In terms of hardware-orientedmemory management, most approaches focus on scalar signals.…”
Section: Context and Motivationmentioning
confidence: 99%
“…Various manual and automated memory access optimization techniques have been developed for uniprocessors and parallel systems [8,10,15]. Access reordering involves modification of the computation to change the order in which data is accessed.…”
Section: Memory Access Optimizationsmentioning
confidence: 99%
“…15 points can fit in the cache. As the number of FFT points increases from 2 14 to 2 16 , the Miss rates for a single precision 256K-point FFT with various cache block sizes…”
mentioning
confidence: 99%