1996
DOI: 10.1109/71.553274
|View full text |Cite
|
Sign up to set email alerts
|

Data forwarding in scalable shared-memory multiprocessors

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
26
0

Year Published

1996
1996
2006
2006

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 46 publications
(26 citation statements)
references
References 19 publications
0
26
0
Order By: Relevance
“…Special deliver instruction initiates the sending of a cache block to processors specified by a bit vector. Koufaty et al proposed a framework for a compiler algorithm to insert data forwarding instructions in the code that exploits loop-level parallelism with do-all constructs [5]. Simulation analysis based on CC-UMA architecture has shown performance improvement of 50% for a system with large caches and 30% for a system with small caches.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Special deliver instruction initiates the sending of a cache block to processors specified by a bit vector. Koufaty et al proposed a framework for a compiler algorithm to insert data forwarding instructions in the code that exploits loop-level parallelism with do-all constructs [5]. Simulation analysis based on CC-UMA architecture has shown performance improvement of 50% for a system with large caches and 30% for a system with small caches.…”
Section: Related Workmentioning
confidence: 99%
“…This approach requires less sophisticated compiler support since it does not require identification of future consumers and it can be implemented at low cost. However, this approach is less flexible than classic data forwarding as defined in [5], because it does not allow forwarding to the processors not having the invalid copies of the data block.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…While difficult to achieve in practice, it assumes that the last write self-invalidate [12,14] and ship the block to the reader through data forwarding. [1,2,10,12] that performs a timely delivery of data to the consumers. Under these assumptions, the coherence miss reduction using the CPC for a history depth of four is shown in Figure 10.…”
Section: Performance Potentialmentioning
confidence: 99%