Observation operators (OOs) are a central component of any data assimilation system. As they project the state variables of a numerical model into the space of the observations, they also provide an ideal opportunity to correct for effects that are not or not sufficiently described by the model. In such cases a dynamical OO, an OO that interfaces to a secondary and more specialised model, often provides the best results. However, given the large number of observations to be assimilated in a typical atmospheric or oceanographic model, the computational resources needed for using a fully dynamical OO mean that 5 this option is usually not feasible. This paper presents a method, based on canonical correlation analysis (CCA), that can be used to generate highly-efficient statistical OOs that are based on a dynamical model. These OOs can provide an approximation to the dynamical model at a fraction of the computational cost.One possible application of such an OO is the modelling of the diurnal cycle of sea surface temperature (SST) in ocean general circulation models (OGCMs). Satellites that measure SST measure the temperature of the thin uppermost layer of the 10 ocean. This layer is strongly affected by the atmospheric conditions and its temperature can differ significantly from the water below. This causes a discrepancy between the SST measurements and the upper layer of the OGCM, which typically has a thickness of around 1 m. The CCA OO method is used to parametrise the diurnal cycle of SST. The CCA OO is based on an input dataset from the General Ocean Turbulence Model (GOTM), a high-resolution water column model that has been specifically tuned for this purpose. The parameterisations of the CCA OO are found to be in good agreement with the results 15 from GOTM, showing the potential of this method for use in data assimilation systems.
IntroductionData assimilation (DA) strives to improve the forecast skill of a numerical model by combining the model with observations.Observations are incorporated into the model by applying a series of corrections to the internal state of the model. As the state variables of a numerical model are usually not observed directly, this procedure requires an observation operator (OO) to 20 project the model state variables onto the variable that is observed. Then, the difference between the observation and the model prediction, the so-called innovation, forms the basis for calculating the correction to the model state. The accuracy of the OO is paramount in this process: any bias in the projection will lead to a bias in the innovation and therefore result in a biased 1 Ocean Sci. Discuss., https://doi.Many different types of OO exist. In its simplest form, an OO could just select one of the state variables in a point near to the observation or, perhaps, perform an interpolation. More complex OOs may include corrections for processes that influence the observation, but are not or not sufficiently modelled. Ultimately one could even consider a dynamical OO that wraps a 5 second numerica...