Proceedings of the 49th Annual Design Automation Conference 2012
DOI: 10.1145/2228360.2228431
|View full text |Cite
|
Sign up to set email alerts
|

Approaching the theoretical limits of a mesh NoC with a 16-node chip prototype in 45nm SOI

Abstract: In this paper, we present a case study of our chip prototype of a 16-node 4x4 mesh NoC fabricated in 45nm SOI CMOS that aims to simultaneously optimize energy-latency-throughput for unicasts, multicasts and broadcasts. We first define and analyze the theoretical limits of a mesh NoC in latency, throughput and energy, then describe how we approach these limits through a combination of microarchitecture and circuit techniques. Our 1.1V 1GHz NoC chip achieves 1-cycle router-and-link latency at each hop and energy… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
73
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 81 publications
(75 citation statements)
references
References 28 publications
2
73
0
Order By: Relevance
“…Existing work reduces latency of NoC routers by either enhancing the classic router [16] or developing simpler micro-architectures. Approaches such as lookaheads [17] add wiring and logic complexity to routers, and increase NoC's area overhead and power consumption. Speculation [7] does not reduce the worst case pipeline delay.…”
Section: Related Workmentioning
confidence: 99%
“…Existing work reduces latency of NoC routers by either enhancing the classic router [16] or developing simpler micro-architectures. Approaches such as lookaheads [17] add wiring and logic complexity to routers, and increase NoC's area overhead and power consumption. Speculation [7] does not reduce the worst case pipeline delay.…”
Section: Related Workmentioning
confidence: 99%
“…To reduce the network latency and buffer read/write power, we implement looka- head (LA) bypassing [19,27]; a lookahead containing control information for a flit is sent to the next router during that flit's ST stage. At the next router, the lookahead performs route-computation and tries to pre-allocate the crossbar for the approaching flit.…”
Section: Main Network Microarchitecturementioning
confidence: 99%
“…To alleviate the overhead imposed by the coherence broadcast requests, routers are equipped with single-cycle multicast support [27]. Instead of sending the same requests for each node one by one into the main network, we allow requests to fork through multiple router output ports in the same cycle, thus providing efficient hardware broadcast support.…”
Section: Main Network Microarchitecturementioning
confidence: 99%
See 2 more Smart Citations