2014 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) 2014
DOI: 10.1109/ispass.2014.6844483
|View full text |Cite
|
Sign up to set email alerts
|

NDC: Analyzing the impact of 3D-stacked memory+logic devices on MapReduce workloads

Abstract: While Processing-in-Memory has been investigated for decades, it has not been embraced commercially. A number of emerging technologies have renewed interest in this topic. In particular, the emergence of 3D stacking and the imminent release of Micron's Hybrid Memory Cube device have made it more practical to move computation near memory. However, the literature is missing a detailed analysis of a killer application that can leverage a Near Data Computing (NDC) architecture. This paper focuses on in-memory MapR… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
152
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 212 publications
(156 citation statements)
references
References 36 publications
2
152
0
Order By: Relevance
“…al. have proposed adding general-purpose in-order cores to the base die of a stacked DRAM to run Map-Reduce workloads more efficiently [40]. For throughput-oriented computing, the TOP-PIM architecture has been proposed to add programmable GPU compute units inside stacked DRAM to achieve significant energy efficiency improvements compared to the traditional GPU architectures [41].…”
Section: Google's Tensor Processing Unitmentioning
confidence: 99%
“…al. have proposed adding general-purpose in-order cores to the base die of a stacked DRAM to run Map-Reduce workloads more efficiently [40]. For throughput-oriented computing, the TOP-PIM architecture has been proposed to add programmable GPU compute units inside stacked DRAM to achieve significant energy efficiency improvements compared to the traditional GPU architectures [41].…”
Section: Google's Tensor Processing Unitmentioning
confidence: 99%
“…Dynamic directories can reduce the exchange of network control messages, leading to energy conservation. The use of a 3D-stacked memory-logic system for MapReduce workloads is explored in [3]. Such designs involve the use of two separate specialized architectures for Map and Reduce phases.…”
Section: Related Workmentioning
confidence: 99%
“…There is great interest in designing multicore chips that are customized for emerging big-data workloads [1], [2], [3]. In this regard, energy-efficient implementations of the MapReduce framework using single chip multicore platforms can enable new discoveries in big data computing.…”
Section: Introductionmentioning
confidence: 99%
“…There are also several related work demonstrating 3D-stacked DRAM and processing elements integrated together for application acceleration [8], [26], [27], and for general purpose computing [6], [10], [5], [7]. Integrating a high-performance general purpose processor underneath the DRAM raises some thermal issues, on the other hand, energy-efficient accelerators are specific to a certain application.…”
Section: Related Workmentioning
confidence: 99%