Weather and climate forecasting with neural networks: using general circulation models (GCMs) with different complexity as a study ground

Scher, Sebastian; Messori, Gabriele

doi:10.5194/gmd-12-2797-2019

Cited by 128 publications

(107 citation statements)

References 24 publications

Supporting

Mentioning

104

Contrasting

Order By: Relevance

“…in favor of the fully convolutional architecture. Furthermore the success of this type of architecture applied to gridded atmospheric fields (Larraondo et al, 2019;Scher, 2018;Scher & Messori, 2019) further validates its use. We performed limited validation of the CNNs using varying numbers of convolutional layers and convolutional filter stencil sizes (and dilation) before obtaining this architecture.…”

Section: Algorithmsmentioning

confidence: 81%

“…However, the "weather" generated by their GCM was idealized in comparison to the real world, because processes like chaotic upscale error growth and factors like seasonality were not included. An extension of this work, which applied CNNs to GCM output including seasons and at higher horizontal resolution, showed a more complicated story (Scher & Messori, 2019). The CNNs performed slightly worse on model simulations including seasons, while their performance was more severely degraded on higher-resolution input, albeit more due to the complexity of the resolved weather than the computational cost of increasing the number of grid points.…”

Section: Introductionmentioning

confidence: 85%

“…Adding more inputs generally requires more training data, and it remains to be seen how much improvement can be obtained when adding additional fields while training with reliable reanalysis data sets that only go back about 60 years. One indication that this data record could be enough for substantial progress is provided by Scher and Messori (2019) who suggested that there is little benefit to using more than 100 years of GCM data to train CNNs to forecast the GCM analog atmospheres. Important improvements might also be obtained by refining the DLWP architecture.…”

Section: Discussionmentioning

confidence: 99%

“…Here, we approach ML weather prediction head-on by developing models that use CNNs to learn to predict 500-hPa geopotential heights and 700-to 300-hPa thickness from 24 years of atmospheric reanalysis on a 2.5 • grid over the Northern Hemisphere. We improve upon the prior work of Dueben and Bauer (2018) by using more advanced NN architectures, more years of reanalysis data, and higher resolution (albeit over the Northern Hemisphere only), while in contrast to Scher (2018) and Scher and Messori (2019), we predict observed weather instead of idealized GCM forecast states. Not surprisingly, our ML models do not compare in forecast accuracy to current operational NWP models, which have been refined by decades of research, operate at much higher resolution, and use far more data to describe the initial condition for each forecast.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Can Machines Learn to Predict Weather? Using Deep Learning to Predict Gridded 500‐hPa Geopotential Height From Historical Weather Data

Weyn

Durran

Caruana

2019

J Adv Model Earth Syst

238

166

View full text Add to dashboard Cite

We develop elementary weather prediction models using deep convolutional neural networks (CNNs) trained on past weather data to forecast one or two fundamental meteorological fields on a Northern Hemisphere grid with no explicit knowledge about physical processes. At forecast lead times up to 3 days, CNNs trained to predict only 500‐hPa geopotential height easily outperform persistence, climatology, and the dynamics‐based barotropic vorticity model, but do not beat an operational full‐physics weather prediction model. These CNNs are capable of forecasting significant changes in the intensity of weather systems, which is notable because this is beyond the capability of the fundamental dynamical equation that relies solely on 500‐hPa data, the barotropic vorticity equation. Modest improvements to the CNN forecasts can be made by adding 700‐ to 300‐hPa thickness to the input data. Our best performing CNN does a good job of capturing the climatology and annual variability of 500‐hPa heights and is capable of forecasting realistic atmospheric states at lead times of 14 days. Although our simple models do not perform better than an operational weather model, machine learning warrants further exploration as a weather forecasting tool; in particular, the potential efficiency of CNNs might make them attractive for ensemble forecasting.

show abstract

Section: Algorithmsmentioning

confidence: 81%

Section: Introductionmentioning

confidence: 85%

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Can Machines Learn to Predict Weather? Using Deep Learning to Predict Gridded 500‐hPa Geopotential Height From Historical Weather Data

Weyn

Durran

Caruana

2019

J Adv Model Earth Syst

238

166

View full text Add to dashboard Cite

show abstract

“…It is increasingly common in meteorology to use machine learning approaches for identifying patterns in the atmosphere using large amounts of historical data (Dueben & Bauer, ; Scher & Messori, ; Ukkonen & Mäkelä, ; Weyn et al, ). This approach, of extracting the underlying physical relationships in the atmosphere from data, opens an opportunity to explore new algorithms that optimize the output based on different verification metrics.…”

Section: Introductionmentioning

confidence: 99%

Optimization of Deep Learning Precipitation Models Using Categorical Binary Metrics

Larraondo

Renzullo

Dijk

et al. 2020

J Adv Model Earth Syst

View full text Add to dashboard Cite

This work introduces a methodology for optimizing neural network models using a combination of continuous and categorical binary indices in the context of precipitation forecasting. Probability of detection and false alarm rate are popular metrics used in the verification of precipitation models. However, machine learning models trained using gradient descent cannot be optimized based on these metrics, as they are not differentiable. We propose an alternative formulation for these categorical indices that are differentiable and we demonstrate how they can be used to optimize the skill of precipitation neural network models defined as a multiobjective optimization problem. To our knowledge, this is the first proposal of a methodology for optimizing weather neural network models based on categorical indices. Plain Language Summary Deep neural networks have recently demonstrated great versatilityand an unprecedented capacity to model complex problems. In weather modeling, these algorithms have been applied to solve different problems. This is a promising area of research, given the availability of large volumes of weather data and increasingly powerful computers. Neural network models can learn to solve problems based on a metric, which the model tries to optimize. However, the quality of weather models is measured using a large variety of metrics, which can be a challenge when choosing which metric the model should optimize. In the case of precipitation, categorical binary metrics are a popular choice to assess the quality of a model. These metrics reduce precipitation to a "yes" or "no" event, and the results of the predicting model can be compared with the actual observations. This method is simple, yet powerful and a large number of indices and statistics have been developed to assess different aspects of the quality of precipitation models. As precipitation models are commonly assessed using these categorical binary metrics, it would be very convenient to optimize models based on them. Unfortunately, the mathematical nature of these metrics makes them unsuitable for optimizing deep learning models. In this work we present an alternative formulation for these categorical binary indices which can be used to train models. We demonstrate how a deep learning model can be trained to generate better quality precipitation data.

show abstract

Application of Machine Learning to Parameterization Emulation and Development

Krasnopolsky,

Belochitski

2023

Fast Processes in Large‐Scale Atmospheric Models

View full text Add to dashboard Cite

Weather and climate forecasting with neural networks: using general circulation models (GCMs) with different complexity as a study ground

Cited by 128 publications

References 24 publications

Can Machines Learn to Predict Weather? Using Deep Learning to Predict Gridded 500‐hPa Geopotential Height From Historical Weather Data

Can Machines Learn to Predict Weather? Using Deep Learning to Predict Gridded 500‐hPa Geopotential Height From Historical Weather Data

Optimization of Deep Learning Precipitation Models Using Categorical Binary Metrics

Application of Machine Learning to Parameterization Emulation and Development

Contact Info

Product

Resources

About