2017
DOI: 10.1177/1094342017701278
|View full text |Cite
|
Sign up to set email alerts
|

Task-based programming in COMPSs to converge from HPC to big data

Abstract: Abstract-Task-based programming has proven to be a suitable model for high-performance computing (HPC) applications. Different implementations have been good demonstrators of this fact, and have promoted the acceptance of task-based programming in the OpenMP standard.Furthermore, in recent years, Apache Spark has gained wide popularity in business and research environments as a programming model for addressing emerging Big-Data problems. COMP Superscalar (COMPSs) is a task-based environment that tackles distri… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0
3

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 29 publications
(18 citation statements)
references
References 12 publications
0
15
0
3
Order By: Relevance
“…A recent work [34] compared COMPSs performance in Java applications to Apache Spark, using a cluster architecture normally associated with HPC applications (e.g., low-latency networks and shared network disks). In this work, our integration allow us to take COMPSs into a cluster usually adopted in Data Science scenarios, with only traditional networking hardware and with disks distributed among the cluster nodes.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…A recent work [34] compared COMPSs performance in Java applications to Apache Spark, using a cluster architecture normally associated with HPC applications (e.g., low-latency networks and shared network disks). In this work, our integration allow us to take COMPSs into a cluster usually adopted in Data Science scenarios, with only traditional networking hardware and with disks distributed among the cluster nodes.…”
Section: Related Workmentioning
confidence: 99%
“…As previously mentioned in Section 2, a recent work [34] compared COMPSs performance in Java applications to Apache Spark, using a cluster architecture normally associated with HPC applications. Since our integration allows COMPSs to better interface with a cluster architecture more frequently found in Data Science scenarios, in this study we present a comparison of the two systems under those conditions, Table 4 presents a performance comparison between COMPSs and Spark using the applications Grep, Wordcount, KMeans and KNN.…”
Section: Spark Versus Compssmentioning
confidence: 99%
“…COMPSs is used in production at MareNostrum supercomputer and has been used to implement different real world applications, specially in the area of BioInformatics and Computational Genomics [10] [13] [17], Big Data analytics [29] and as building block for several scientific cyber-infrastructures [45] [22]. Other examples of applications developed with COMPSs can be found in [7].…”
Section: Compss Overviewmentioning
confidence: 99%
“…PySpark is a binding to the widely extended framework Spark [17]. A previous paper compares several Big Data algorithms using the native version of both COMPSs 1 and Spark runtimes [18], showing that COMPSs is able to get better or competitive results in comparison to Spark.…”
Section: Introductionmentioning
confidence: 99%