2007
DOI: 10.1535/itj.1103.01
|View full text |Cite
|
Sign up to set email alerts
|

Integration Challenges and Tradeoffs for Terascale Architectures

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
46
0

Year Published

2008
2008
2013
2013

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 66 publications
(46 citation statements)
references
References 0 publications
0
46
0
Order By: Relevance
“…This led to the concept of tiled architecture, characterized by regular structures of homogeneous tiles, each one consisting of a processing core, a cache memory and a router. Further research in this area inspired designs with heterogeneous tiles, preserving the regularity of the structure, but introducing several classes of tiles [1], [7], [8]. Such systems may include some specialized processors (e.g., graphics, DSP) or different implementations of the same architecture (e.g., in-order/outof-order, multi-threading) with varied power-performance trade-offs.…”
Section: A Tiled Cmp Architecturementioning
confidence: 99%
“…This led to the concept of tiled architecture, characterized by regular structures of homogeneous tiles, each one consisting of a processing core, a cache memory and a router. Further research in this area inspired designs with heterogeneous tiles, preserving the regularity of the structure, but introducing several classes of tiles [1], [7], [8]. Such systems may include some specialized processors (e.g., graphics, DSP) or different implementations of the same architecture (e.g., in-order/outof-order, multi-threading) with varied power-performance trade-offs.…”
Section: A Tiled Cmp Architecturementioning
confidence: 99%
“…Azimi et al [2] affirm that it is necessary to find a way to keep the off-die bandwidth manageable in system architectures with tradeoffs among cost, power, and performance. Moreover, in a hardware context, the system must offer flexibility with high-bandwidth, low-latency, and power-efficiency.…”
Section: Introductionmentioning
confidence: 99%
“…For example, the dual-core IBM Power6 (Le et al, 2007) and the eight-core Sun UltraSPARC T2 (Shah et al, 2007) have a relatively small number of cores, which are typically connected through a shared medium, i.e., a bus or a crossbar. However, CMP architectures that integrate tens of processor cores (usually known as many-core CMPs) are expected for the near future, after Intel recently unveiled the 80-core Polaris prototype (Azimi et al, 2007). Since the area required by a shared interconnect becomes impractical as the number of cores grows (Kumar et al, 2005), it seems that the processing cores of future CMPs will be connected by means of unordered point-to-point networks.…”
Section: Introductionmentioning
confidence: 99%
“…On the other hand, most CMP systems provide programmers with the intuitive sharedmemory model, which requires efficient support for cache coherence. Although a great deal of attention was devoted to scalable cache coherence protocols in the last decades in the context of shared-memory multiprocessors, the technological parameters and constraints entailed by many-core CMPs demand new solutions to the cache coherence problem (Bosschere et al, 2007;Azimi et al, 2007). In this chapter, we focus on three main design goals for cache coherence protocols aimed at being employed in many-core CMPs: performance, on-chip network traffic, and area requirements.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation