Voltage, Throughput, Power, Reliability, and Multicore Scaling

Xia, Fei; Rafiev, Ashur; Aalsaud, Ali; Al-hayanni, Mohammed A. Noaman; Davis, James J.; Levine, Joshua M.; Mokhov, Andrey; Romanovsky, Alexander; Shafik, Rishad; Yakovlev, Alex; Yang, Sheng

doi:10.1109/mc.2017.3001246

Cited by 13 publications

(7 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It can be verified that (29) transforms into Sun-Ni's model (10) by substituting f 1 with 1 − f and f n with f ⋅ g(n), and f j = 0, ∀1 < j < n. For g(n) = n the model further transforms into Gustafson's (7), and for g(n) = 1 it becomes classical Amdahl's law (4). Other related models, for instance that extending models similar to Sun-Ni's over Hill-Marty asymmetric heterogeneity [55], are also covered by the multi-fraction model with similar arguments.…”

Section: Parallelism and P-fractions!mentioning

confidence: 95%

“…The classical method for modelling the speedup of workload processing caused by some measure of improving the computation capabilities is known as Amdahl's law, which developed from observations presented by Amdahl in 1967 [1]. Amdahl [2,42] no p-fraction yes no no no no [3] no p-fraction yes yes yes yes no [5] load balancing and scheduling p-fraction yes yes yes yes no [7] no p-fraction yes yes yes no no [13] no parallelism yes yes no no no [14] load balancing and scheduling parallelism yes yes no no no [15] no p-fraction yes yes yes yes no [17] synchronisation and communication p-fraction yes yes no no no [18] no p-fraction yes no no no no [4,43] no p-fraction yes no no no no [19,20,44] no p-fraction yes no no no no [22] no p-fraction yes no no no no [24] no multi p-fraction and parallelism yes yes no no no [25] no p-fraction yes no no no no [26] time of parallel tasks no no no no no no [27] no p-fraction yes no no no no [28] no p-fraction yes no no no no [29] no no yes yes no no no [38] no [68] no p-fraction yes no no no no [69] run-time no no yes no no yes [70] run-time no no yes yes yes yes provide a mathematical formula for this law, which was later formulated based on his verbal arguments. Given the context of this paper, which is about the parallelisation of workloads on M/MCP systems, 'improvement of computation capabilities' generally means the incorporation of multiple processing units (to be called 'cores' in this paper) to improve the speed of workload execution, unless otherwise noted.…”

Section: Amdahl's Law and Gustafson's Modelmentioning

confidence: 99%

“… Speedup versus the number of cores for (a) Amdahl's law, (b) Gustafson's model, (c) Sun–Ni's model with

g false(n false) = n^{3 / 2}

. Figure adapted from [7]…”

Section: Non‐processing Elements and Overheadsmentioning

confidence: 99%

“…On performance, it is well known that the parallelisation of workloads may bring speedup [1][2][3][4][5]. On energy efficiency, with the advent of such hardware techniques as dynamic voltage and frequency scaling (DVFS), it is possible to trade an increase in the number of processing units for a reduction of energy consumption without affecting performance [6][7][8][9].…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Amdahl's law in the context of heterogeneous many‐core systems – a survey

Al-hayanni

Xia

Rafiev

et al. 2020

IET Computers & Digital Techniques

Self Cite

View full text Add to dashboard Cite

For over 50 years, Amdahl's Law has been the hallmark model for reasoning about performance bounds for homogeneous parallel computing resources. As heterogeneous, many-core parallel resources continue to permeate into the modern server and embedded domains, there has been growing interest in promulgating realistic extensions and assumptions in keeping with newer use cases. This study aims to provide a comprehensive review of the purviews and insights provided by the extensive body of work related to Amdahl's law to date, focusing on computation speedup. The authors show that a significant portion of these studies has looked into analysing the scalability of the model considering both workload and system heterogeneity in real-world applications. The focus has been to improve the definition and semantic power of the two key parameters in the original model: the parallel fraction (f) and the computation capability improvement index (n). More recently, researchers have shown normal-form and multi-fraction extensions that can account for wider ranges of heterogeneity, validated on many-core systems running realistic workloads. Speedup models from Amdahl's law onwards have seen a wide range of uses, such as the optimisation of system execution, and these uses are even more important with the advent of the heterogeneous many-core era.

show abstract

Section: Parallelism and P-fractions!mentioning

confidence: 95%

Section: Amdahl's Law and Gustafson's Modelmentioning

confidence: 99%

“… Speedup versus the number of cores for (a) Amdahl's law, (b) Gustafson's model, (c) Sun–Ni's model with

g false(n false) = n^{3 / 2}

. Figure adapted from [7]…”

Section: Non‐processing Elements and Overheadsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Amdahl's law in the context of heterogeneous many‐core systems – a survey

Al-hayanni

Xia

Rafiev

et al. 2020

IET Computers & Digital Techniques

Self Cite

View full text Add to dashboard Cite

show abstract

“…Run-time management is a set of methods for managing hardware/software knobs and monitors under variable workloads to improve system operation, for example by optimizing some chosen metric in the performance/energy trade-off [7]. Existing run-time algorithms react to workload change by dynamically scaling voltage/frequency (DVFS) [8], in combination with task mapping and core allocations [9].…”

Section: Introductionmentioning

confidence: 99%

PARMA: Parallelization-Aware Run-Time Management for Energy-Efficient Many-Core Systems

Al-hayanni

Rafiev

Xia

et al. 2020

IEEE Trans. Comput.

Self Cite

View full text Add to dashboard Cite

Performance and energy efficiency considerations have shifted computing paradigms from single-core to many-core architectures. At the same time, traditional speedup models such as Amdahl's Law face challenges in the run-time reasoning for system performance and energy efficiency, because these models typically assume limited variations of the parallel fraction. Moreover, the parallel fraction, which varies dynamically in workloads, is generally unknown at run-time without application-level instrumentation. This paper describes novel performance/energy trade-off models based on realistic architectural considerations, which describe the parallel fraction and speedup as functions of performance counter values available in modern processors, removing the need for application-level instrumentation. These are then used to develop a Parallelization-Aware Run-time Management (PARMA) approach. PARMA aims at controlling core allocations and operating voltage/frequency points for energy efficiency, according to the varying workload parallel fractions. The efficacy of our models and the PARMA approach is extensively validated using a number of PARSEC benchmark applications, involving two performance/energy trade-off metrics: energy-delay-product (EDP), typically used in high-performance applications and energy per instruction (EPI), suitable for energy-aware applications. Up to 48 and 68 per-cent improvements in EDP and EPI have been observed using the PARMA approach compared with parallelization-agnostic methods.

show abstract