2018
DOI: 10.1145/3273982.3273991
|View full text |Cite
|
Sign up to set email alerts
|

Extreme Datacenter Specialization for Planet-Scale Computing

Abstract: Planet-scale applications are driving the exponential growth of the cloud, and datacenter specialization is the key enabler of this trend, providing order of magnitudes improvements in cost-effectiveness and energy-efficiency. While exascale computing remains a goal for supercomputing, specialized datacenters have emerged and have demonstrated beyond-exascale performance and efficiency in specific domains. This paper generalizes the applications, design methodology, and deployment challenges of the most extrem… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 28 publications
0
6
0
Order By: Relevance
“…First, as D. Mackenzie (2021) has noticed, every material political economy, by virtue of its materiality, produces specific ‘ spatial materialities’ (p. 12 original emphasis). In both blockchain and AI, this spatial materiality instantiated itself in clouds understood as large-scale assemblages for ‘planet-scale computing’ ( Xie et al, 2018 ), with Bitcoin ASICs and Google TPUs being the cutting edge of those industries. However, cloud computing is more than just an assemblage of datacentres ( Amoore, 2018 ), but is instead a highly flexible arrangement ( Narayan, 2022 ).…”
Section: Conclusion: Going Full Stackmentioning
confidence: 99%
“…First, as D. Mackenzie (2021) has noticed, every material political economy, by virtue of its materiality, produces specific ‘ spatial materialities’ (p. 12 original emphasis). In both blockchain and AI, this spatial materiality instantiated itself in clouds understood as large-scale assemblages for ‘planet-scale computing’ ( Xie et al, 2018 ), with Bitcoin ASICs and Google TPUs being the cutting edge of those industries. However, cloud computing is more than just an assemblage of datacentres ( Amoore, 2018 ), but is instead a highly flexible arrangement ( Narayan, 2022 ).…”
Section: Conclusion: Going Full Stackmentioning
confidence: 99%
“…Recent work such as ASIC Clouds [62,99] has used design space exploration to optimize directly for datacenter total cost of ownership in the context of bitcoin mining, video transcoding, and machine learning accelerators. FAST extends this by considering return-oninvestment (ROI) and using ROI to demonstrate production feasibility of FAST-generated designs.…”
Section: Related Workmentioning
confidence: 99%
“…Our example ROI calculation assumes a NVIDIA DGX A100 320GB platform baseline containing 8x A100 accelerators with a manufacturer's suggested price (MSRP) of $199,000 [87]. We assume the May 2021 average price of electricity for the US Commercial sector ($0.1084/kWh) from the US Energy Information Administration [1], an accelerator deployment lifetime of 3 years [89], the cost per engineer based on the reported median total compensation for a SWE working in the San Francisco bay area ($240,000) [3] with a 65% salary overhead [99], and all other values from previous work [99]. Since our experimental results assume a sub-10nm process technology, we extrapolate wafer mask and PHY IP costs using exponential scaling as observed in [99].…”
Section: The Economics Of Specialized Acceleratorsmentioning
confidence: 99%
“…Lack of floating-point: PDP hardware, being designed solely for efficient packet processing, lacks floating-point arithmetic support even in more general purpose NPU SmartNICs. Luckily, the embedded ML literature offers many low-precision floatingpoint formats [24,42,49,51] and fixed-point representations [35,54]. The latter set requires only integer arithmetic, which allows us to express and update policy parameters in-NIC.…”
Section: Design and Implementationmentioning
confidence: 99%