2007
DOI: 10.1175/waf966.1
|View full text |Cite
|
Sign up to set email alerts
|

Scoring Probabilistic Forecasts: The Importance of Being Proper

Abstract: Questions remain regarding how the skill of operational probabilistic forecasts is most usefully evaluated or compared, even though probability forecasts have been a long-standing aim in meteorological forecasting. This paper explains the importance of employing proper scores when selecting between the various measures of forecast skill. It is demonstrated that only proper scores provide internally consistent evaluations of probability forecasts, justifying the focus on proper scores independent of any attempt… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
162
0

Year Published

2007
2007
2022
2022

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 163 publications
(162 citation statements)
references
References 12 publications
0
162
0
Order By: Relevance
“…If the forecaster is concerned to achieve the best possible score, then it may be in his/her interests to issue a forecast that is inconsistent with his/her beliefs about what is likely to happen. A strictly proper score is one for which the forecaster uniquely optimizes the expected score by forecasting his/her true beliefs (Bröcker and Smith, 2007;Jolliffe, 2008). In most situations a strictly proper score would be desirable so that the forecaster does not issue a misleading forecast.…”
Section: Proprietymentioning
confidence: 99%
“…If the forecaster is concerned to achieve the best possible score, then it may be in his/her interests to issue a forecast that is inconsistent with his/her beliefs about what is likely to happen. A strictly proper score is one for which the forecaster uniquely optimizes the expected score by forecasting his/her true beliefs (Bröcker and Smith, 2007;Jolliffe, 2008). In most situations a strictly proper score would be desirable so that the forecaster does not issue a misleading forecast.…”
Section: Proprietymentioning
confidence: 99%
“…Several different attributes of the forecast ensemble are often quantified, including model bias and ensemble spread, as well as the correspondence between forecast and outcome pairs using a variety of statistical metrics (Jolliffe and Stephenson 2003). Such metrics include deterministic skill scores, which consider the ensemble mean properties of a set of predictions, and probabilistic skill scores (Bröcker and Smith 2007) that quantify the quality of the full distribution of ensemble members relative to a reference forecast system (such as climatology, persistence or another forecast system) (Suckling and Smith 2013). Reliability measures the correspondence between the predicted probabilities and observed frequencies of a particular set of events.…”
Section: Assessing the Quality Of Climate Forecastsmentioning
confidence: 99%
“…Bröcker and Smith (2006) show that only proper scores provide internally consistent evaluations of probability forecasts. The BS is not the only proper scoring rule (Gneiting and Raftery, 2004;Bröcker and 186 M. S. ROULSTON Smith, 2006), but it is probably the most widely used in the verification of operational probability forecasts. An example of an improper scoring rule is the naive linear scoring rule defined by 2 LS = (1 − f ) if the event occurs f if the event does not occur (6)…”
Section: Introductionmentioning
confidence: 95%
“…The BS discourages hedging and this is generally regarded as a highly desirable trait for a forecaster performance metric (Winkler and Murphy, 1969;Lindley, 1985;Murphy and Winkler, 1987;Wilks, 1995;Potts, 2002;Gneiting and Raftery, 2004). Bröcker and Smith (2006) show that only proper scores provide internally consistent evaluations of probability forecasts. The BS is not the only proper scoring rule (Gneiting and Raftery, 2004;Bröcker and 186 M. S. ROULSTON Smith, 2006), but it is probably the most widely used in the verification of operational probability forecasts.…”
Section: Introductionmentioning
confidence: 99%