Automatic Minimization of Execution Budgets of SPITS Programs in AWS

Okita, Nicholas T.; Coimbra, Tiago A.; Rodamilans, Charles; Tygel, Martin; Borin, Edson

doi:10.1007/978-3-030-41050-6_2

Cited by 2 publications

(2 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Considering the trend of using cloud computing for processing of seismic data, 8‐11 all experiments took place on virtual machines of the Amazon Web Services (AWS) cloud provider. Table 1 shows the instances used, their costs and their characteristics.…”

Section: Performance Analysis Of the Main File Formats In Seismologymentioning

confidence: 99%

See 1 more Smart Citation

High‐performance IO for seismic processing on the cloud

Guimarães

Lacalle

Rodamilans

et al. 2021

Concurrency and Computation

Self Cite

View full text Add to dashboard Cite

Summary Most of the applications in the seismology field rely on the processing of up to hundreds of terabytes of data and their performance is strongly affected by IO operations. In this article, we analyze the main file structures currently used to store seismic data and propose a new intermediate data structure to improve IO performance while still complying with established standards. We show that, throughout a common workflow in seismic data analysis, our IO performance gain greatly surpasses the overhead of translating data to the intermediate structure. This approach enables a speedup of up to 208 times in reading time when using classical standards (e.g., SEG‐Y) and our intermediate structure is up to 1.8 times more efficient than modern formats (e.g., ASDF). Considering cache‐friendly applications, our speedups over the direct use of SEG‐Y reach 8000 times. We also performed a cost analysis on the AWS cloud showing that, in our approach, HDDs can be 1.25 times more cost‐effective than SSDs.

show abstract

Section: Performance Analysis Of the Main File Formats In Seismologymentioning

confidence: 99%

“…The increasing use of cloud computing in seismology 8‐11 also introduces new aspects to be considered. The cloud offers many different storage mechanisms, each one presenting different behaviors, performance characteristics and costs.…”

Section: Introductionmentioning

confidence: 99%