2021
DOI: 10.1109/access.2021.3079139
|View full text |Cite
|
Sign up to set email alerts
|

Towards HPC and Big Data Analytics Convergence: Design and Experimental Evaluation of a HPDA Framework for eScience at Scale

Abstract: Over the last two decades, scientific discovery has increasingly been driven by the large availability of data from a multitude of sources, including high-resolution simulations, observations and instruments, as well as an enormous network of sensors and edge components. In such a dynamic and growing landscape where data continue to expand, advances in Science have become intertwined with the capacity of analysis tools to effectively handle and extract valuable information from this ocean of data. In view of t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0
1

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
3
1
1

Relationship

5
3

Authors

Journals

citations
Cited by 15 publications
(9 citation statements)
references
References 74 publications
0
8
0
1
Order By: Relevance
“…The front-end server and the computing components are usually deployed on different nodes of the infrastructure. Furthermore, the number of Ophidia computing components can be scaled up, also dynamically, over multiple nodes of the infrastructure to address more intensive data analytics workloads [19]. To this end, the framework supports integration with different HPC scheduling systems.…”
Section: Pyophidiamentioning
confidence: 99%
“…The front-end server and the computing components are usually deployed on different nodes of the infrastructure. Furthermore, the number of Ophidia computing components can be scaled up, also dynamically, over multiple nodes of the infrastructure to address more intensive data analytics workloads [19]. To this end, the framework supports integration with different HPC scheduling systems.…”
Section: Pyophidiamentioning
confidence: 99%
“…In this respect, the climate community has been continuously pushing the boundaries to deploy and run model simulations at the highest resolution possible, exploiting cutting-edge supercomputing infrastructures [7]. The resulting output consists of large, complex and heterogeneous datasets that require proper solutions for management and knowledge extraction [8], and which can take advantage of data-oriented approaches from DA and AI fields.…”
Section: Climate Modellingmentioning
confidence: 99%
“…Indeed, climate scientists, meteorological agencies and policy decision makers need to process and extract meaningful information from these huge data sets in a cost‐effective manner and in a reasonable amount of time (Sebestyén et al., 2021). In this context, High Performance Data analytics systems can address some of the issues and provide support for descriptive/statistical analysis of this large‐scale data (Elia et al., 2021). Nevertheless, in the last few years Machine Learning (ML) and Deep Learning (DL) algorithms became popular as data‐driven paradigms for supporting feature extraction from the vast amounts of scientific data currently available (Hey et al., 2020).…”
Section: Introductionmentioning
confidence: 99%