Proceedings Sixth International Symposium on High-Performance Computer Architecture. HPCA-6 (Cat. No.PR00550)
DOI: 10.1109/hpca.2000.824358
|View full text |Cite
|
Sign up to set email alerts
|

A DSM architecture for a parallel computer Cenju-4

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 11 publications
0
4
0
Order By: Relevance
“…In order to resolve the former problem, methods have been proposed that reduce the amount of memory required in the directories by storing incomplete sharing information in the directories [1,2]. Although these methods are able to reduce the amount of memory compared to management by the full map scheme, the problem of delays remains because the directories still need to be accessed via the MIN.…”
Section: Min That Include Cache Control Mechanismsmentioning
confidence: 97%
See 1 more Smart Citation
“…In order to resolve the former problem, methods have been proposed that reduce the amount of memory required in the directories by storing incomplete sharing information in the directories [1,2]. Although these methods are able to reduce the amount of memory compared to management by the full map scheme, the problem of delays remains because the directories still need to be accessed via the MIN.…”
Section: Min That Include Cache Control Mechanismsmentioning
confidence: 97%
“…A variety of methods have been proposed to address these problems, including methods that reduce the hardware cost of the directories by storing incomplete sharing information [1,2], methods that implement part of the cache within the switches in the MINs [4,5], and methods that implement the directory within the switches in the MINs [6][7][8]. However, none of these methods have been able to sufficiently resolve the increases in memory capacities and access times and there remains room for improvement.…”
Section: Introductionmentioning
confidence: 98%
“…The number of forward and backward pointers is proportional to the number of cache lines in the machine, which is much smaller than the number of memory lines. Although some optimizations to the initial proposal have been studied (for example, in [12] and [13]) and several commercial machines have been implemented using this kind of protocol, such as the Sequent NUMA-Q [14] and Convex Exemplar [15] multiprocessors, the important drawbacks these protocols entail have decreased their popularity, and from the SGI Origin 2000 [6] onward, most designs use memory-based directory protocols, such as Piranha [16], the AlphaServer GS320 [17], or the Cenju-4 [18]. Among others, these drawbacks include the increased latency of coherence transactions as well as occupancy of cache controllers, complex protocol implementations [5] and, what is more important, the need of larger cache states and extra bits for forward and backward pointers, which implies changing processor caches.…”
Section: Cache-based Directory Protocolsmentioning
confidence: 99%
“…We tried to improve their performance on an NEC Cenju-3 [10] and an NEC Cenju-4 [9] using trace files generated on a cluster of PCs. The reason why we use our cluster for trace file generation is simply due to the limitation on usage of the storage available on the Cenju-3 and Cenju-4.…”
Section: Case Studiesmentioning
confidence: 99%