Performance comparison of Dask and Apache Spark on HPC systems for neuroimaging

Dugré, Mathieu; Hayot‐Sasson, Valérie; Glatard, Tristan

doi:10.1002/cpe.7635

Cited by 2 publications

(1 citation statement)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, adapting MPI-dependent models to Spark involves significant changes to exploit cloud computing's high-performance capabilities effectively [49][50][51]. Efforts to integrate Spark with existing HPC architectures or modify it for enhanced performance are ongoing, with research focusing on extending Spark's utility for complex, high-throughput computing tasks typically handled by MPI [52]. This transition highlights the necessity of adapting high-performance computing paradigms to fit hybrid cloud environments, ensuring efficient data handling and computation.…”

Section: 、Computational Processesmentioning

confidence: 99%

Adaptive Cross-platform Scheduling Framework for NWP in Hybrid Clouds

Ding

2024

Preprint

View full text Add to dashboard Cite

Numerical Weather Prediction (NWP) requires real-time, high-accuracy processing, straining traditional high-performance computing clusters with limited resources, complex operations, and long queue times. Hybrid clouds merge the security of local clusters with the scalability of public clouds, providing a viable solution for high-performance computations. However, it also poses challenges: parallel programming for local clusters is not suitable for the various settings of hybrid clouds; complex parallelization policies increase communication overhead and complicate scheduling; and traditional static resource binding can lead to load imbalance in heterogeneous environments. This paper proposes an adaptive cross-platform scheduling strategy tailored to the characteristics of NWP models. This approach harmonizes the advantages of traditional and cloud-based parallel computing, integrating two distinct parallel programming methodologies and reconfiguring the parallel programming framework of the forecasting models. Experimental results show that the framework effectively improves adaptability and resource utilization, significantly improves computational efficiency and reduces operational overhead in hybrid cloud deployments.

show abstract