The joint influence of break and noise variance on the break detection capability in time series homogenization

Lindau, Ralf; Venema, Victor

doi:10.5194/ascmo-4-1-2018

Cited by 24 publications

(38 citation statements)

References 26 publications

(38 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Using very high time frequency data series increases, for example, the noise of time series. Lindau and Venema (2018) showed that for a pairwise multiple breakpoint algorithm, the results for low signal‐to‐noise ratios (SNRs) do not differ much from random segmentations and that reliable break detection at low but realistic SNRs needs a new approach. However, a break identified by on the methods assessed here can be adjusted for in the individual observation series, and these homogenized individual data points can then be used for weather and climate extreme applications and assimilation into reanalysis products.…”

Section: Discussionmentioning

confidence: 99%

Homogenizing GPS Integrated Water Vapor Time Series: Benchmarking Break Detection Methods on Synthetic Data Sets

Malderen

Pottiaux

Kłos

et al. 2020

Earth and Space Science

View full text Add to dashboard Cite

We assess the performance of different break detection methods on three sets of benchmark data sets, each consisting of 120 daily time series of integrated water vapor differences. These differences are generated from the Global Positioning System (GPS) measurements at 120 sites worldwide, and the numerical weather prediction reanalysis (ERA-Interim) integrated water vapor output, which serves as the reference series here. The benchmark includes homogeneous and inhomogeneous sections with added nonclimatic shifts (breaks) in the latter. Three different variants of the benchmark time series are produced, with increasing complexity, by adding autoregressive noise of the first order to the white noise model and the periodic behavior and consecutively by adding gaps and allowing nonclimatic trends. The purpose of this "complex experiment" is to examine the performance of break detection methods in a more realistic case when the reference series are not homogeneous. We evaluate the performance of break detection methods with skill scores, centered root mean square errors (CRMSE), and trend differences relative to the trends of the homogeneous series. We found that most methods underestimate the number of breaks and have a significant number of false detections. Despite this, the degree of CRMSE reduction is significant (roughly between 40% and 80%) in the easy to moderate experiments, with the ratio of trend bias reduction is even exceeding the 90% of the raw data error. For the complex experiment, the improvement ranges between 15% and 35% with respect to the raw data, both in terms of RMSE and trend estimations.

show abstract

Section: Discussionmentioning

confidence: 99%

Homogenizing GPS Integrated Water Vapor Time Series: Benchmarking Break Detection Methods on Synthetic Data Sets

Malderen

Pottiaux

Kłos

et al. 2020

Earth and Space Science

View full text Add to dashboard Cite

show abstract

“…Thus, to decide which number of breaks is optimal a stop criterion is needed that penalizes the insertion of breaks. Caussinus and Mestre (2004), Domonkos (2011a) and Lindau and Venema (2018a) use the Lyazrhi stop criterion (Caussinus and Lyazrhi, 1997), which is given by…”

Section: Fraction Of Undetected Breaksmentioning

confidence: 99%

“…For the ideal case without noise, Lindau and Venema (2018a) showed that the explained variance grows with k by…”

Section: Fraction Of Undetected Breaksmentioning

confidence: 99%

A new method to study inhomogeneities in climate records: Brownian motion or random deviations?

Lindau

Venema

2019

Intl Journal of Climatology

Self Cite

View full text Add to dashboard Cite

Climate data are affected by inhomogeneities due to historical changes in the way the measurements were performed. Understanding these inhomogeneities is important for accurate estimates of long‐term changes in the climate. These inhomogeneities are typically characterized by the number of breaks and the size of the jumps or the variance of the break signal, but a full characterization of the break signal also includes its temporal behaviour. This study develops a method to distinguish between two types of breaks: random deviations from a baseline and Brownian motion. Strength and frequency of both break types are estimated by using the variance of the spatiotemporal differences in the time series of two nearby stations as input. Thus, the result is directly obtained from the data without running a homogenization algorithm to estimate the break signal from the data. This opens the possibility to determine the total number of breaks and not only that of the significantly large ones. The application to German temperature observations suggests generally small inhomogeneities dominated by random deviations from a baseline. U.S. stations, on the other hand, also show the characteristics of a strong Brownian‐motion‐type component.

show abstract

“…This suggests that stairs are easier to detect than platforms, although also gradual inhomogeneities modelled as linear trends in the station data could have lowered the percentage of detected platforms. The percentage of stairs and platforms as they are detectable by a homogenization algorithm will be further dependent, e.g., on the signal-to-noise ratio (SNR), which is shown to be a key parameter for break detection (Lindau and Venema, 2016;Lindau and Venema, 2018a). Thus, to conclude that the reduced number of detected platforms is directly caused by an admixture of BM-type breaks is risky and has been a motivation for this further study.…”

Section: The Different Characteristics Of Brownian Motion and Random mentioning

confidence: 99%

A new method to study inhomogeneities in climate records: Brownian Motion or Random Deviations?

Lindau¹,

Venema²

2019

Preprint

Self Cite

View full text Add to dashboard Cite

Climate data is affected by inhomogeneities due to historical changes in the way the measurements were performed. Understanding these inhomogeneities is important for accurate estimates of long-term changes in the climate. These inhomogeneities are typically characterized by the number of breaks and the size of the jumps or the variance of the break signal, but a full characterization of the break signal also includes its temporal behavior. This study develops a method to distinguish between two types of breaks: random deviations from a baseline and Brownian motion. Strength and frequency of both break types are estimated by using the variance of the spatiotemporal differences in the time series of two nearby stations as input. Thus, the result is directly obtained from the data without running a homogenization algorithm to estimate the break signal from the data. This opens the possibility to determine the total number of breaks and not only that of the significantly large ones. The application to German temperature observations suggests generally small inhomogeneities dominated by random deviations from a baseline. US stations, on the other hand, show also the characteristics of a strong Brownian motion type component.

show abstract

The joint influence of break and noise variance on the break detection capability in time series homogenization

Cited by 24 publications

References 26 publications

Homogenizing GPS Integrated Water Vapor Time Series: Benchmarking Break Detection Methods on Synthetic Data Sets

Homogenizing GPS Integrated Water Vapor Time Series: Benchmarking Break Detection Methods on Synthetic Data Sets

A new method to study inhomogeneities in climate records: Brownian motion or random deviations?

A new method to study inhomogeneities in climate records: Brownian Motion or Random Deviations?

Contact Info

Product

Resources

About