Proceedings of the 21st Annual International Conference on Supercomputing 2007
DOI: 10.1145/1274971.1275000
|View full text |Cite
|
Sign up to set email alerts
|

Performance driven data cache prefetching in a dynamic software optimization system

Abstract: Software or hardware data cache prefetching is an efficient way to hide cache miss latency. However effectiveness of the issued prefetches have to be monitored in order to maximize their positive impact while minimizing their negative impact on performance. In previous proposed dynamic frameworks, the monitoring scheme is either achieved using processor performance counters or using specific hardware. In this work, we propose a prefetching strategy which does not use any specific hardware component or processo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2008
2008
2021
2021

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 14 publications
(5 citation statements)
references
References 18 publications
0
5
0
Order By: Relevance
“…We hence include them to calculate the eventual priority of each cache line. As to the type of a cache line, due to the inaccuracy of the prefetcher, prefetched cache lines have a high likelihood of ending up as the dead ones [21] [22]. We give each cache line 1 bit to state whether it is prefetched or not.…”
Section: B Design Of Cbpmentioning
confidence: 99%
“…We hence include them to calculate the eventual priority of each cache line. As to the type of a cache line, due to the inaccuracy of the prefetcher, prefetched cache lines have a high likelihood of ending up as the dead ones [21] [22]. We give each cache line 1 bit to state whether it is prefetched or not.…”
Section: B Design Of Cbpmentioning
confidence: 99%
“…As a binary optimizer, ADORE [7] uses hardware counters to identify hotspots and phases and to apply memory prefetching optimizations. Similar goals are addressed in [5] where a dynamic system inserts prefetch instructions on-the-fly where it has been evaluated as effective by measuring the load latency. Both approaches focus on reducing the memory latency of memory instructions.…”
Section: Related Workmentioning
confidence: 99%
“…The final state is the prefetching of these delinquent loads. Another prefetch solution is dynamic prefetching as proposed by Beyler et al [3]. They studied a dynamic prefetch mechanism using the load latency variation to classify the loads.…”
Section: Related Workmentioning
confidence: 99%
“…This solution is based on hardware monitor of the Itanium processor. The two previous work [3] and [7] were done on Itanium architecture which is used for high performance computing. Our work is done on a light embedded VLIW processor which generally executes a single task; so, the dynamic prefetch mechanism is an inappropriate solution for our kind of architecture.…”
Section: Related Workmentioning
confidence: 99%