2016
DOI: 10.3390/electronics5040061
|View full text |Cite
|
Sign up to set email alerts
|

A Raspberry Pi Cluster Instrumented for Fine-Grained Power Measurement

Abstract: Abstract:Power consumption has become an increasingly important metric when building large supercomputing clusters. One way to reduce power usage in large clusters is to use low-power embedded processors rather than the more typical high-end server CPUs (central processing units). We investigate various power-related metrics for seventeen different embedded ARM development boards in order to judge the appropriateness of using them in a computing cluster. We then build a custom cluster out of Raspberry Pi board… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0
1

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 43 publications
(24 citation statements)
references
References 24 publications
0
23
0
1
Order By: Relevance
“…Furthermore, they employ optimized CUDA coding on GPUs to show that the mobile Tegra K1 is 61.8x faster than Cortex-A9, whereas high-end desktop GPUs achieve a huge gain over A9, in the area of 1000x (e.g., 2032x versus Nvidia Tesla K80 with nominal 300 W). We note that, in a broader sense [66,67], the throughput of embedded CPUs could increase even by 10x when we consider higher clock rates (e.g., 2.3 GHz) and/or bigger microarchitectures (e.g., ARM Cortex A-57). However, for the sake of categorization, we focus here on more conservative scenarios, such as Cortex A9/A15 operating below 1 GHz.…”
Section: A Literature Resultsmentioning
confidence: 98%
“…Furthermore, they employ optimized CUDA coding on GPUs to show that the mobile Tegra K1 is 61.8x faster than Cortex-A9, whereas high-end desktop GPUs achieve a huge gain over A9, in the area of 1000x (e.g., 2032x versus Nvidia Tesla K80 with nominal 300 W). We note that, in a broader sense [66,67], the throughput of embedded CPUs could increase even by 10x when we consider higher clock rates (e.g., 2.3 GHz) and/or bigger microarchitectures (e.g., ARM Cortex A-57). However, for the sake of categorization, we focus here on more conservative scenarios, such as Cortex A9/A15 operating below 1 GHz.…”
Section: A Literature Resultsmentioning
confidence: 98%
“…Raspberry Pi 2 Model B was used in the construction of cluster with 25 nodes in [17]. Their goal was to create a cluster for power measurement and educational purposes, proving that the excellent performances can be obtained with low power consumption.…”
Section: Related Workmentioning
confidence: 99%
“…Performance comparison between conventional CPUs to embedded ARM processors integrated in SoC devices such as OMAP 4460 are performed in [12], where a relation that involve CPU frequencies, performance and types of CPUs is shown. The study in [13] shows that the throughput of embedded CPUs could be increased tenfold when clock rates of 2.3 GHz are used. From [14], it is derived that the Cortex-A15 processor at 1.2 GHz is 3-5x faster than the Cortex-A9 of Zynq at 667 Mhz.…”
Section: Introductionmentioning
confidence: 99%