2016
DOI: 10.1016/j.procs.2016.05.297
|View full text |Cite
|
Sign up to set email alerts
|

Modeling and Implementation of an Asynchronous Approach to Integrating HPC and Big Data Analysis 1

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
19
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(19 citation statements)
references
References 11 publications
0
19
0
Order By: Relevance
“…Figure 14.4 illustrates the traditional method, which is the simplest method without optimizations (next subsection will show an optimized version of the traditional method) [8]. The traditional method works as follows: the compute processes compute results and write computed results to disks, followed by the analysis processes reading results and then analyzing the results.…”
Section: The Traditional Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Figure 14.4 illustrates the traditional method, which is the simplest method without optimizations (next subsection will show an optimized version of the traditional method) [8]. The traditional method works as follows: the compute processes compute results and write computed results to disks, followed by the analysis processes reading results and then analyzing the results.…”
Section: The Traditional Methodsmentioning
confidence: 99%
“…The other improvement is that the user input is divided into a number of fine-grain blocks and written to disks asynchronously. Figure 14.5 shows this improved version of the traditional method [8]. We can see that the output stage is now overlapped with the computation stage so that the output time might be hidden by the computation time.…”
Section: Improved Version Of the Traditional Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Another research area is focused on developing APIs to speed up data access for analytics use cases [17,25,37]. GLEAN [37] is a framework that provides an infrastructure for accelerating I/O interfacing with running simulations for co-analysis and supports in-situ analysis.…”
Section: Related Workmentioning
confidence: 99%
“…With the rise in popularity of using ML to analyze simulation data and automate scientific workflows [14,15,20,21,23,24], there is growing interest in exploiting Spark for HPC simulation data. Recent work on using Spark varies from implementing traditional analysis pipelines [26,33,34] to developing APIs to speed up data access [17,25,37].…”
mentioning
confidence: 99%