2017 IEEE 15th Intl Conf on Dependable, Autonomic and Secure Computing, 15th Intl Conf on Pervasive Intelligence and Computing, 2017
DOI: 10.1109/dasc-picom-datacom-cyberscitec.2017.28
|View full text |Cite
|
Sign up to set email alerts
|

A Survey of Semantics-Aware Performance Optimization for Data-Intensive Computing

Abstract: We are living in the era of Big Data and witnessing the explosion of data. Given that the limitation of CPU and I/O in a single computer, the mainstream approach to scalability is to distribute computations among a large number of processing nodes in a cluster or cloud. This paradigm gives rise to the term of data-intensive computing, which denotes a data parallel approach to process massive volume of data. Through the efforts of different disciplines, several promising programming models and a few platforms h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 46 publications
0
2
0
Order By: Relevance
“…A survey about data‐intensive scientific workflow managements (SWfMSs) 7 is presented for parallel execution across distributed resource providers, such as cloud infrastructure, and to improve the execution in a multisite cloud. According to Rao and Wang, 8 the emergence of mobile devices is the primary source of using the internet anywhere as needed. With this mobile device emergence in a very high manner, the big data‐intensive applications have also emerged in a vast manner and play a vital role in many people's life.…”
Section: Related Workmentioning
confidence: 99%
“…A survey about data‐intensive scientific workflow managements (SWfMSs) 7 is presented for parallel execution across distributed resource providers, such as cloud infrastructure, and to improve the execution in a multisite cloud. According to Rao and Wang, 8 the emergence of mobile devices is the primary source of using the internet anywhere as needed. With this mobile device emergence in a very high manner, the big data‐intensive applications have also emerged in a vast manner and play a vital role in many people's life.…”
Section: Related Workmentioning
confidence: 99%
“…To break through these dilemmas, a growing number of data-intensive computing frameworks have been proposed, such as MapReduce [13], Apache Hadoop [3], and Spark [45]. Generally, a mainstream approach to gain computing capability and scalability behind these platforms is to distribute data and computations across a cluster of nodes so that a large volume of data can be processed in a parallel and robust manner within a reasonable time [32], [50]. The successes of these frameworks owe to their MapReduce-like programming models, which are further based on data distribution techniques (e.g., Resilient Distributed Dataset (RDD) in Apache Spark [44]), and highorder functions (e.g., map, reduce, f ilter) that can take userdefined functions (UDFs) as arguments.…”
Section: Introductionmentioning
confidence: 99%