Machine Learning for Model Error Inference and Correction

Bonavita, Massimo; Laloyaux, Patrick

doi:10.1029/2020ms002232

Cited by 90 publications

(86 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Weak-constraint DA [62] is similar, in that it does not improve the forward model, but estimates a spatial field of model errors. ML could be equally applicable to learning this kind of model error [102]. However, in weak-constraint DA, it can be hard to separate these errors from errors in the state, if they occur on similar spatial scales [63].…”

Section: Learning New Earth System Physicsmentioning

confidence: 99%

Learning earth system models from observations: machine learning or data assimilation?

Geer

2021

Phil. Trans. R. Soc. A.

119

View full text Add to dashboard Cite

Recent progress in machine learning (ML) inspires the idea of improving (or learning) earth system models directly from the observations. Earth sciences already use data assimilation (DA), which underpins decades of progress in weather forecasting. DA and ML have many similarities: they are both inverse methods that can be united under a Bayesian (probabilistic) framework. ML could benefit from approaches used in DA, which has evolved to deal with real observations—these are uncertain, sparsely sampled, and only indirectly sensitive to the processes of interest. DA could also become more like ML and start learning improved models of the earth system, using parameter estimation, or by directly incorporating machine-learnable models. DA follows the Bayesian approach more exactly in terms of representing uncertainty, and in retaining existing physical knowledge, which helps to better constrain the learnt aspects of models. This article makes equivalences between DA and ML in the unifying framework of Bayesian networks. These help illustrate the equivalences between four-dimensional variational (4D-Var) DA and a recurrent neural network (RNN), for example. More broadly, Bayesian networks are graphical representations of the knowledge and processes embodied in earth system models, giving a framework for organising modelling components and knowledge, whether coming from physical equations or learnt from observations. Their full Bayesian solution is not computationally feasible but these networks can be solved with approximate methods already used in DA and ML, so they could provide a practical framework for the unification of the two. Development of all these approaches could address the grand challenge of making better use of observations to improve physical models of earth system processes. This article is part of the theme issue ‘Machine learning for weather and climate modelling’.

show abstract

Section: Learning New Earth System Physicsmentioning

confidence: 99%

Learning earth system models from observations: machine learning or data assimilation?

Geer

2021

Phil. Trans. R. Soc. A.

119

View full text Add to dashboard Cite

show abstract

“…Researchers used DL to estimate ground-level PM2.5 or PM10 levels by using satellite observations and station measurements (Li et al, 2017;Shen et al, 2018;Tang et al, 2018). DL also helps improve the accuracy of weather forecasting, which is a long-standing challenge in atmospheric science (Bonavita & Laloyaux, 2020;Scher & Messori, 2021). The tracks of typhoons were predicted with a GAN based on satellite images (Rüttgers et al, 2019).…”

Section: Atmospheric Sciencementioning

confidence: 99%

“…With multiple realizations of dropout, the results are collected, and the variance is computed as the uncertainty. DL with uncertainty estimation in inference is reported in areas such as volcano-seismic monitoring (Bueno et al, 2019), geomagnetic storm forecasting (Tasistro-Hart et al, 2020), weather forecasting (Scher & Messori, 2021;Bonavita & Laloyaux, 2020), soil moisture predictions (Fang, Kifer, et al, 2020) and earthquake locations estimation (Mousavi & Beroza, 2020b).…”

Section: Uncertainty Estimationmentioning

confidence: 99%

Deep Learning for Geophysics: Current and Future Trends

2021

Reviews of Geophysics

281

View full text Add to dashboard Cite

show abstract

“…With spatially dense and noise-free data, this approach has been based on sparse regression [9], echo state networks [10,11], recurrent neural networks (NN) [12], residual neural network [13] or convolutional neural networks [14,15]. The challenging problem of partial and/or noisy observations has been addressed using dedicated NN architecture [16] or in combination with DA methods [17][18][19][20][21].…”

Section: Introductionmentioning

confidence: 99%

Combining data assimilation and machine learning to infer unresolved scale parametrization

Brajard

Carrassi

Bocquet

et al. 2021

Phil. Trans. R. Soc. A.

106

View full text Add to dashboard Cite

In recent years, machine learning (ML) has been proposed to devise data-driven parametrizations of unresolved processes in dynamical numerical models. In most cases, the ML training leverages high-resolution simulations to provide a dense, noiseless target state. Our goal is to go beyond the use of high-resolution simulations and train ML-based parametrization using direct data, in the realistic scenario of noisy and sparse observations. The algorithm proposed in this work is a two-step process. First, data assimilation (DA) techniques are applied to estimate the full state of the system from a truncated model. The unresolved part of the truncated model is viewed as a model error in the DA system. In a second step, ML is used to emulate the unresolved part, a predictor of model error given the state of the system. Finally, the ML-based parametrization model is added to the physical core truncated model to produce a hybrid model. The DA component of the proposed method relies on an ensemble Kalman filter while the ML parametrization is represented by a neural network. The approach is applied to the two-scale Lorenz model and to MAOOAM, a reduced-order coupled ocean-atmosphere model. We show that in both cases, the hybrid model yields forecasts with better skill than the truncated model. Moreover, the attractor of the system is significantly better represented by the hybrid model than by the truncated model. This article is part of the theme issue ‘Machine learning for weather and climate modelling’.

show abstract

Machine Learning for Model Error Inference and Correction

Cited by 90 publications

References 37 publications

Learning earth system models from observations: machine learning or data assimilation?

Learning earth system models from observations: machine learning or data assimilation?

Deep Learning for Geophysics: Current and Future Trends

Combining data assimilation and machine learning to infer unresolved scale parametrization

Contact Info

Product

Resources

About