Proceedings of the International Symposium on Memory Systems 2017
DOI: 10.1145/3132402.3132427
|View full text |Cite
|
Sign up to set email alerts
|

Identifying the potential of near data processing for apache spark

Abstract: While cluster computing frameworks are continuously evolving to provide real-time data analysis capabilities, Apache Spark has managed to be at the forefront of big data analytics for being a unified framework for both, batch and stream data processing. There is also a renewed interest in Near Data Processing (NDP) due to technological advancement in the last decade. However, it is not known if NDP architectures can improve the performance of big data processing frameworks such as Apache Spark. In this paper, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 28 publications
0
8
0
Order By: Relevance
“…Traditionally workload characterization is done using hardware performance counters present on our current microprocessors. Awan et al [50] use hardware performance counters to characterize the scale-out big data processing workloads into CPU-bound, memory-bound, and I/O-bound applications and propose programmable logic near DRAM and NVRAM. However, the use of hardware performance counters is limited by the impact of micro-architecture features like cache size, issue width etc [51].…”
Section: Microarchitecture-dependent Workload Characterizationmentioning
confidence: 99%
“…Traditionally workload characterization is done using hardware performance counters present on our current microprocessors. Awan et al [50] use hardware performance counters to characterize the scale-out big data processing workloads into CPU-bound, memory-bound, and I/O-bound applications and propose programmable logic near DRAM and NVRAM. However, the use of hardware performance counters is limited by the impact of micro-architecture features like cache size, issue width etc [51].…”
Section: Microarchitecture-dependent Workload Characterizationmentioning
confidence: 99%
“…Unfortunately, the integration of a profiler (such as Perf or Pin) in a compiler or run-time system is still a challenging task [68] due to its dynamic nature. Therefore, current solutions rely on commercial profiling tools, such as Intel VTune [70], to detect the offloading kernels [30], [67], [69]. Additionally, some works [30], [38], [56] use the bandwidth saving as an offloading metric.…”
Section: Discussionmentioning
confidence: 99%
“…NDPproposals have explored many memory architectures. In the literature, SRAM-based NDP proposals mostly aim to insert logic capabilities to the host's cache memories or to the host's memory controllers [17,19,20,21,22,23,24,58,59,103,106]. This work modify the cache hierarchy trying to avoid moving data from the main memory and cache memories to the host's core.…”
Section: B Memory Architectures and Ndpmentioning
confidence: 99%