2021
DOI: 10.1038/s43588-021-00156-2
|View full text |Cite
|
Sign up to set email alerts
|

Compressing atmospheric data into its real information content

Abstract: Hundreds of petabytes are produced annually at weather and climate forecast centers worldwide. Compression is essential to reduce storage and to facilitate data sharing. Current techniques do not distinguish the real from the false information in data, leaving the level of meaningful precision unassessed. Here we define the bitwise real information content from information theory for the Copernicus Atmospheric Monitoring Service (CAMS). Most variables contain fewer than 7 bits of real information per value and… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(7 citation statements)
references
References 64 publications
0
7
0
Order By: Relevance
“…The f i accumulate around the LBM lattice weights (for D3Q19 w i ∈ { 1 36 , 1 18 , 1 3 }) and the f shifted i accumulate around 0, where floating-point accuracy is best. So for FP32 not only are the trailing bits of the mantissa expected to be nonphysical numerical noise [89], but also some bits of the exponent are entirely unused, meaning one can waive these bits without losing accuracy.…”
Section: Which Range Of Numbers Does the Lbm Use?mentioning
confidence: 99%
See 2 more Smart Citations
“…The f i accumulate around the LBM lattice weights (for D3Q19 w i ∈ { 1 36 , 1 18 , 1 3 }) and the f shifted i accumulate around 0, where floating-point accuracy is best. So for FP32 not only are the trailing bits of the mantissa expected to be nonphysical numerical noise [89], but also some bits of the exponent are entirely unused, meaning one can waive these bits without losing accuracy.…”
Section: Which Range Of Numbers Does the Lbm Use?mentioning
confidence: 99%
“…Using FP32 arithmetic and FP16 DDF storage would be even better, but has not yet been attempted due to concerns about possibly insufficient accuracy. Lower 16-bit precision has already been successfully applied to other fluid solvers [89][90][91] and to a lot of other high-performance computing software [92,93].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…A few, recent studies show spatial correlation is intuitively an influential factor of transformation- and prediction-based compressors. In Klöwer et al (2021), the concept of bitwise real information (BIR) is introduced as the mutual information of bits in adjacent grid points. In particular, the stronger the association with neighboring bits, the greater the BIR.…”
Section: Statistical Prediction Of Compression Ratiosmentioning
confidence: 99%
“…The architecture was designed to address the challenge of exascale computing that will allow massive ensemble runs [155]. Ensemble data assimilation with large ensembles and large models requires high performance I/O [156][157][158]. This is due to the large amount of data that needs to be circulated between different constituents of the assimilation system.…”
Section: Discussionmentioning
confidence: 99%