2014 43rd International Conference on Parallel Processing 2014
DOI: 10.1109/icpp.2014.19
|View full text |Cite
|
Sign up to set email alerts
|

A Case for Resource Efficient Prefetching in Multicores

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(4 citation statements)
references
References 17 publications
0
4
0
Order By: Relevance
“…This version shows the limits/potential of the compiler to make use of the available architectural registers. InO-SW-Prefetch is a software prefetch solution taken from the work of Khan et al [38,40] 1 . InO-SW-prefetch uses profiling to identify delinquent loads to prefetch and manually inserts prefetch instructions at the appropriate distance.…”
Section: Swoop Unrolling and Sw Prefetchingmentioning
confidence: 99%
See 1 more Smart Citation
“…This version shows the limits/potential of the compiler to make use of the available architectural registers. InO-SW-Prefetch is a software prefetch solution taken from the work of Khan et al [38,40] 1 . InO-SW-prefetch uses profiling to identify delinquent loads to prefetch and manually inserts prefetch instructions at the appropriate distance.…”
Section: Swoop Unrolling and Sw Prefetchingmentioning
confidence: 99%
“…Comparison of unrolling and software prefetching with SWOOP, with and without hardware prefetching enabled. We evaluate the benchmarks common in the work proposing a modern software prefetcher [38,40] and SWOOP.…”
Section: Related Workmentioning
confidence: 99%
“…This version shows the limits/potential of the compiler to make use of the available architectural registers. InO-SW-Prefetch is a software prefetch solution taken from the work of Khan et al [38,40] 1 . InO-SW-prefetch uses proiling to identify delinquent loads to prefetch and manually inserts prefetch instructions at the appropriate distance.…”
Section: Swoop Unrolling and Sw Prefetchingmentioning
confidence: 99%
“…In order to increase the number of prefetching operations issue in time (and increase the speedup of the system), the prefetchers usually generates several prefetch requests each time it is triggered (to have a wider range of success), and this decreases the accuracy of the technique. Note that hardware prefetchers in commodity processors can be very aggressive, sometimes prefetching more than twice the data required for an application [38], [37]. Figure 1.3 shows some performance numbers obtained in [21] that also supports this argument.…”
Section: List Of Figuresmentioning
confidence: 66%