Proceedings 2000 International Conference on Parallel Processing
DOI: 10.1109/icpp.2000.876165
|View full text |Cite
|
Sign up to set email alerts
|

The NUMAchine multiprocessor

Abstract: Abstract

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 37 publications
(24 citation statements)
references
References 45 publications
0
24
0
Order By: Relevance
“…The use of this topology was inspired by a NUMAchine multiprocessor designed at University in Toronto. As shown in [18], good speedups were observed for virtually all multiprocessor benchmarks, in spite of its apparent bisectional bandwidth limitations. The architecture was shown amenable to efficient implementations, and the cache coherence protocol incorporated in NUMAchine exploited well the given topology resulting in a feasible and correct implementation.…”
Section: Related Workmentioning
confidence: 85%
See 2 more Smart Citations
“…The use of this topology was inspired by a NUMAchine multiprocessor designed at University in Toronto. As shown in [18], good speedups were observed for virtually all multiprocessor benchmarks, in spite of its apparent bisectional bandwidth limitations. The architecture was shown amenable to efficient implementations, and the cache coherence protocol incorporated in NUMAchine exploited well the given topology resulting in a feasible and correct implementation.…”
Section: Related Workmentioning
confidence: 85%
“…Because the buses are used at the lowest level of the hierarchy in [18], the total number of nodes will be kept modest if the number of hierarchy levels is kept low. By replacing local buses with meshes, we can accommodate more processors for the same hierarchical ring.…”
Section: B Suitability Of Hierarchical Ringsmentioning
confidence: 99%
See 1 more Smart Citation
“…In future, this work can be expanded for incorporation with multiprocessor design [41][42] [43], for enhancing debug features [44], in reliable networks on chip [45][46], as well as reversible [47], embedded high-density memory [48] design, and the extension is possible with transform-based techniques under lack of available data [49]. Finally, the low-power [50] and the sequential design test [51] can be applied.…”
Section: Future Workmentioning
confidence: 99%
“…As commercial CMPs continue to increase core counts, a new network design will be needed that balances the simplicity and low overhead of rings with the scalability of more complex topologies. [43,51,21,44,19] allows "local rings" with simple node routers to scale by connecting to a "global ring" via bridge routers.…”
Section: Introductionmentioning
confidence: 99%