Energy Efficient Run-Time Incremental Mapping for 3-D Networks-on-Chip

Wang, Xiaohang; Liu, Peng; Yang, Mei; Palesi, Maurizio; Jiang, Yingtao; Huang, Michael C.

doi:10.1007/s11390-013-1312-x

Cited by 23 publications

(13 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The floorplan of each tile in NoC is shown as in Fig. 11, as in [37]. The dimension of the Alpha core is adopted from [?…”

Section: Methodsmentioning

confidence: 99%

“…The configuration of the network-on-chip is listed in Table 3. The many-core system floorplanning can be found in [37]. The temperature threshold is 60 o C. We compare our approach with the following two runtime thermal-aware mapping algorithms that aim to dark silicon era, (1) DsRem [24], where the cores on/off patterning are identified followed by tasks mapped to active cores, and (2) PAT [22], where a core region including inactive cores is found for each application.…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Bubble budgeting: throughput optimization for dynamic workloads by exploiting dark cores in many core systems

Wang

Singh

et al. 2016

2016 Tenth IEEE/ACM International Symposium on Networks-on-Chip (NOCS)

View full text Add to dashboard Cite

Abstract-All the cores of a many-core chip cannot be active at the same time, due to reasons like low CPU utilization in server systems and limited power budget in dark silicon era. These free cores (referred to as bubbles) can be placed near active cores for heat dissipation so that the active cores can run at a higher frequency level, boosting the performance of applications that run on active cores. Budgeting inactive cores (bubbles) to applications to boost performance has the following three challenges. First, the number of bubbles varies due to open workloads. Second, communication distance increases when a bubble is inserted between two communicating tasks (a task is a thread or process of a parallel application), leading to performance degradation. Third, budgeting too many bubbles as coolers to running applications leads to insufficient cores for future applications. In order to address these challenges, in this paper, a bubble budgeting scheme is proposed to budget free cores to each application so as to optimize the throughput of the whole system. Throughput of the system depends on the execution time of each application and the waiting time incurred for newly arrived applications. Essentially, the proposed algorithm determines the number and locations of bubbles to optimize the performance and waiting time of each application, followed by tasks of each application being mapped to a core region. A Rollout algorithm is used to budget power to the cores as the last step. Experiments show that our approach achieves 50% higher throughput when compared to state-of-the-art thermal-aware runtime task mapping approaches. The runtime overhead of the proposed algorithm is in the order of 1M cycles, making it an efficient runtime task management method for large-scale many-core systems.

show abstract

“…The floorplan of each tile in NoC is shown as in Fig. 11, as in [37]. The dimension of the Alpha core is adopted from [?…”

Section: Methodsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

Bubble budgeting: throughput optimization for dynamic workloads by exploiting dark cores in many core systems

Wang

Singh

et al. 2016

2016 Tenth IEEE/ACM International Symposium on Networks-on-Chip (NOCS)

View full text Add to dashboard Cite

show abstract

“…Table 4 lists the mixes of the benchmarks selected from PARSEC and SPLASH-2. The thermal parameters are adopted from [35]. Each tile is composed by a processor core, L2 cache bank, and a router.…”

Section: Methodsmentioning

confidence: 99%

An efficient runtime power allocation scheme for many-core systems inspired from auction theory

et al. 2015

Self Cite

View full text Add to dashboard Cite

“…The evolution of SoC design to the third dimension offers a lot of opportunities such as integration of inhomogeneous cores which results in several challenges including optimal inhomogeneous NoC topologies, router architectures and application mapping techniques [9], [10], [11], [12], [13]. Various 3D NoC topologies are presented and evaluated in [14], [15], [16], [17] where homogeneous 3D routers are employed in each architecture.…”

Section: Related Workmentioning

confidence: 99%

“…However, these algorithms have very high computational complexities. Wang et al [46] proposed a mapping algorithm for 3D NoCs based on run-time incremental mapping technique [47]. Here, the algorithm tries to map applications to convex regions while utilizing as many vertical links as possible in the mapping process.…”

Section: Accepted Manuscriptmentioning

confidence: 99%

Energy and performance-aware application mapping for inhomogeneous 3D networks-on-chip

Agyeman

Ahmadinia

Bagherzadeh

2018

Journal of Systems Architecture

View full text Add to dashboard Cite

Three dimensional Networks-on-Chip (3D NoCs) have evolved as an ideal solution to the communication demands and complexity of future high density many core architectures. However, the design practicality of 3D NoCs faces several challenges such as thermal issues, high power consumption and area overhead of 3D routers as well as high complexity and cost of vertical link implementation. To mitigate the performance and manufacturing cost of 3D NoCs, inhomogeneous architectures have emerged to combine 2D and 3D routers in 3D NoCs producing lower area and energy consumption while maintaining the performance of homogeneous 3D NoCs. Due to the limited number of vertical links, application mapping on inhomogeneous 3D NoCs can be complex. However, application mapping has a great impact on the performance and energy consumption of NoCs. This paper presents an energy and performance aware application mapping algorithm for inhomogeneous 3D NoCs. The algorithm has been evaluated with various realistic traffic patterns and compared with existing mapping algorithms. Experimental results show NoCs mapped with the proposed algorithm have lower energy consumption and significant reduction in packet delays compared to the existing algorithms and comparable average packet latency with Branch-and-Bound.

show abstract

Energy Efficient Run-Time Incremental Mapping for 3-D Networks-on-Chip

Cited by 23 publications

References 34 publications

Bubble budgeting: throughput optimization for dynamic workloads by exploiting dark cores in many core systems

Bubble budgeting: throughput optimization for dynamic workloads by exploiting dark cores in many core systems

An efficient runtime power allocation scheme for many-core systems inspired from auction theory

Energy and performance-aware application mapping for inhomogeneous 3D networks-on-chip

Contact Info

Product

Resources

About