1994
DOI: 10.1145/195470.195569
|View full text |Cite
|
Sign up to set email alerts
|

The performance impact of flexibility in the Stanford FLASH multiprocessor

Abstract: A flexible communication mechanism is a desirable feature in multiprocessors because it allows support for multiple communication protocols, expands performance monitoring capabilities, and leads to a simpler design and debug process. In the Stanford FLASH multiprocessor, flexibility is obtained by requiring all transactions in a node to pass through a programmable node controller, called MAGIC. In this paper, we evaluate the performance costs of flexibility by comparing the performance of FLASH to that of an … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

1996
1996
2007
2007

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 46 publications
(3 citation statements)
references
References 13 publications
0
3
0
Order By: Relevance
“…DE-FG02-03ER25564 writing correct and efficient shared-memory programs without locks, semaphores, or condition variables. Stanford has produced significant research results on this topic [6][7][8] [9], and an ongoing collaboration between Stanford and USC/ISI has produced a prototype optimizing compiler for high-level TCC constructs.…”
Section: Transactional Coherence and Consistency (Tcc)mentioning
confidence: 99%
“…DE-FG02-03ER25564 writing correct and efficient shared-memory programs without locks, semaphores, or condition variables. Stanford has produced significant research results on this topic [6][7][8] [9], and an ongoing collaboration between Stanford and USC/ISI has produced a prototype optimizing compiler for high-level TCC constructs.…”
Section: Transactional Coherence and Consistency (Tcc)mentioning
confidence: 99%
“…Hardware can thus directly initiate a DRAM access once the tag lookup indicates a miss. The latency of the software handler's replacement decision then overlaps the main memory access latency [11]. 1.…”
Section: Access Time Overheadmentioning
confidence: 99%
“…Several recent commercial and research multiprocessor systems [18,15,22] have employed programmable coherence controllers to reduce design time and/or support multiple protocols. However, the flexibility and generality of a programmable controller leads to slower coherence protocol execution, which in turn increases controller occupancy and memory latency [11]. The extent to which this degrades application performance has been the subject of several detailed simulation studies [12,23,19].…”
Section: Programmable Coherence Controllersmentioning
confidence: 99%