Probabilistic Forecasts: Scoring Rules and Their Decomposition and Diagrammatic Representation via Bregman Divergences

Hughes, G.; Topp, Cairistiona F. E.

doi:10.3390/e17085450

Cited by 7 publications

(4 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Water Resources Research Hughes and Topp (2015) to provide a diagrammatic interpretation of the Brier scoring rule and associated score divergences.…”

Section: D3 Pseudospherical Scorementioning

confidence: 99%

“…TableE1is taken fromHughes and Topp (2015) and summarizes a data set of n = 346 forecasts of 24-hr precipitation probability made by the Finnish Meteorological Institute during 2003 for the city of Tampere in southcentral Finland. The left block presents the original data, and the right block lists the data used in our case study.Forecast probabilities of rainfall p k ; k = (1, …, m) were issued using m = 11 categories.…”

mentioning

confidence: 99%

“…Finally, n k /n corresponds to the relative frequency of each forecast category. We refer readers toHughes and Topp (2015) for a more detailed description of the data set. The raw precipitation data can be found at https://www.cawcr.gov.au/projects/…”

mentioning

confidence: 99%

See 2 more Smart Citations

Distribution‐Based Model Evaluation and Diagnostics: Elicitability, Propriety, and Scoring Rules for Hydrograph Functionals

Vrugt

2024

Water Resources Research

View full text Add to dashboard Cite

Distribution forecasts P over future quantities or events are routinely made in hydrology but usually traded for a (likelihood‐weighted) mean or median prediction to accommodate error measures or scoring functions such as the mean absolute error or mean squared error. Case in point is the so‐called KG efficiency (KGE) of Gupta et al. (2009, https://doi.org/10.1016/j.jhydrol.2009.08.003) and improvements thereof (Lamontagne et al., 2020, https://doi.org/10.1029/2020wr027101), which have rapidly gained popularity among hydrologists as alternative scoring functions to the commonly used Nash and Sutcliffe (1970, https://doi.org/10.1016/0022‐1694(70)90255‐6) efficiency, but are equally exclusive in how they quantify model performance using only single‐valued output of the quantities of interest. This point‐valued mapping necessarily implies a loss of information about model performance. This paper advocates the use of probabilistic watershed model training, evaluation and diagnostics. Distribution evaluation opens a mature literature on scoring rules whose strong statistical underpinning provides, as we will demonstrate, the theory, context and guidelines necessary for the development of robust information‐theoretically principled metrics for watershed signatures. These so‐called hydrograph functionals are scalar‐valued mappings of major behavioral watershed functions embodied in a strictly proper scoring rule. We discuss past developments that led to the current state‐of‐the‐art of distribution evaluation in hydrology and review scoring rules for dichotomous and categorical events, quantiles (intervals) and density forecasts. We are particularly concerned with elicitable functionals and scoring rule propriety, discuss the decomposition of scoring rules into a sharpness, reliability and entropy term and present diagnostically appealing strictly proper divergence scores of hydrograph functionals for flood frequency analysis, flow duration and recession curves. The usefulness and power of distribution‐based model evaluation and diagnostics by means of scoring rules is demonstrated on simple illustrative problems and discharge distributions simulated with watershed models using random sampling and Bayesian model averaging. The presented theory (a) enables a more complete evaluation of distribution forecasts, (b) offers a statistically principled means for watershed model training, evaluation, diagnostics and selection using hydrograph functionals and/or extreme events and (c) provides a universal framework for metric development of watershed signatures, promoting metric standardization and reproducibility.

show abstract

“…Water Resources Research Hughes and Topp (2015) to provide a diagrammatic interpretation of the Brier scoring rule and associated score divergences.…”

Section: D3 Pseudospherical Scorementioning

confidence: 99%

mentioning

confidence: 99%

See 1 more Smart Citation

Distribution‐Based Model Evaluation and Diagnostics: Elicitability, Propriety, and Scoring Rules for Hydrograph Functionals

Vrugt

2024

Water Resources Research

View full text Add to dashboard Cite

show abstract

“…Logarithmic scoring rule is also used to elicit the agent's beliefs in terms of subjective probabilities. However, logarithmic scoring rule attaches larger penalties than the quadratic scoring rule [28]. The logarithmic scoring rule deducts for inaccuracy by adding the natural log of the occurred event's probability from the base score [26].…”

Section: ) Logarithmic Scoring Rulementioning

confidence: 99%

Eliciting Truthful Data From Crowdsourced Wireless Monitoring Modules in Cloud Managed Networks

et al. 2020

View full text Add to dashboard Cite

To facilitate efficient cloud managed resource allocation solutions, collection of key wireless metrics from multiple access points (APs) at different locations within a given area is required. In unlicensed shared spectrum bands collection of metric data can be a challenging task for a cloud manager as independent self-interested APs can operate in these bands in the same area. We propose to design an intelligent crowdsourcing solution that incentivizes independent APs to truthfully measure/report data relating to their wireless channel utilization (CU). Our work focuses on challenging scenarios where independent APs can take advantage of recurring patterns in CU data by utilizing distribution aware strategies to obtain higher reward payments. We design truthful reporting methods that utilize logarithmic and quadratic scoring rules for reward payments to the APs. We show that when measurement computation costs are considered then under certain scenarios these scoring rules no longer ensure incentive compatibility. To address this, we present a novel reward function which incorporates a distribution aware penalty cost that charges APs for distorting reports based on recurring patterns. Along with synthetic data, we also use real CU data values crowdsourced using multiple independent measuring/reporting devices deployed by us in the University of Oulu.

show abstract

Seasonal climate predictions for marine risk assessment in the Barents Sea

et al. 2022

View full text Add to dashboard Cite

Probabilistic Forecasts: Scoring Rules and Their Decomposition and Diagrammatic Representation via Bregman Divergences

Cited by 7 publications

References 26 publications

Distribution‐Based Model Evaluation and Diagnostics: Elicitability, Propriety, and Scoring Rules for Hydrograph Functionals

Distribution‐Based Model Evaluation and Diagnostics: Elicitability, Propriety, and Scoring Rules for Hydrograph Functionals

Eliciting Truthful Data From Crowdsourced Wireless Monitoring Modules in Cloud Managed Networks

Seasonal climate predictions for marine risk assessment in the Barents Sea

Contact Info

Product

Resources

About