1992
DOI: 10.1109/2.121510
|View full text |Cite
|
Sign up to set email alerts
|

The Stanford Dash multiprocessor

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
327
0
10

Year Published

1997
1997
2021
2021

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 767 publications
(337 citation statements)
references
References 13 publications
0
327
0
10
Order By: Relevance
“…In this section, we take a closer look at two speci c DSM implementations, the hardware cache-coherent DSM examples include 46,44,23,38,47 , and the software page-based DSM examples include 48,6,13,36,37 . We will focus on how the implementation of distributed shared memory and cache coherence di er on these two architectures.…”
Section: Dsm Implementationmentioning
confidence: 99%
“…In this section, we take a closer look at two speci c DSM implementations, the hardware cache-coherent DSM examples include 46,44,23,38,47 , and the software page-based DSM examples include 48,6,13,36,37 . We will focus on how the implementation of distributed shared memory and cache coherence di er on these two architectures.…”
Section: Dsm Implementationmentioning
confidence: 99%
“…Most current multicore architectures do not have this problem since they are not using multithreading (T p = 1) for latency hiding but coherent caches to exploit access locality in programs (where available) or just try to tolerate natural latency defined by the distance of memory access making memory access nonuniform [17,22].…”
Section: Adding Numa Supportmentioning
confidence: 99%
“…They propose a set of optimizations that can reach the performance of heavyweight hardware support. These optimizations include write-forwarding [1,19,20,26] at line boundaries, synchronization counters in L2 caches (which they do not describe), and small dedicated receive-side caches for pipelined streaming data in a separate address space. Our design integrates equivalent mechanisms inside general purpose caches, augmented with RDMA for efficient bulk transfers.…”
Section: Related Work and Contributionsmentioning
confidence: 99%