2023
DOI: 10.51408/1963-0100
|View full text |Cite
|
Sign up to set email alerts
|

Data Compression-Aware Performance Analysis of Dask and Spark for Earth Observation Data Processing

Abstract: High-performance computing is a good choice for handling Big Earth Observation data, allowing the processing of the data in a distributed and performance-efficient way using in-memory computing frameworks. The data compression technique reduces the amount of storage and network transfer time and improves processing performance. The article aims to investigate the effectiveness of widely used distributed data processing frameworks in conjunction with lossless data compression techniques, to find the optimal com… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 20 publications
0
1
0
Order By: Relevance
“…This examination compares the effects of the Dask and Spark environments on the speed of data processing. Study shows [13] that Dask and Spark both offer comparable data processing performance. However, combining the Dask environment with the Zstandard compression technique yields the best performance results.…”
Section: Decision-makingmentioning
confidence: 89%
“…This examination compares the effects of the Dask and Spark environments on the speed of data processing. Study shows [13] that Dask and Spark both offer comparable data processing performance. However, combining the Dask environment with the Zstandard compression technique yields the best performance results.…”
Section: Decision-makingmentioning
confidence: 89%