Abstract

AbstractThe philosophy of forecast verification is rather different between deterministic and probabilistic verification metrics: generally speaking, deterministic metrics measure differences, whereas probabilistic metrics assess reliability and sharpness of predictive distributions. This article considers the root‐mean‐square error (RMSE), which can be seen as a deterministic metric, and the probabilistic metric Continuous Ranked Probability Score (CRPS), and demonstrates that under certain conditions, the CRPS can be mathematically expressed in terms of the RMSE when these metrics are aggregated. One of the required conditions is the normality of distributions. The other condition is that, while the forecast ensemble need not be calibrated, any bias or over/underdispersion cannot depend on the forecast distribution itself. Under these conditions, the CRPS is a fraction of the RMSE, and this fraction depends only on the heteroscedasticity of the ensemble spread and the measures of calibration. The derived CRPS–RMSE relationship for the case of perfect ensemble reliability is tested on simulations of idealised two‐dimensional barotropic turbulence. Results suggest that the relationship holds approximately despite the normality condition not being met.

Highlights

  • Operational numerical weather prediction (NWP) centres use a range of metrics to monitor and communicate forecast performance and make decisions about model upgrades

  • We have derived a functional relationship between two forecast verification metrics: the Continuous Ranked Probability Score (CRPS) and the root-mean-square error (RMSE) (Sections 2 and 3)

  • The RMSE is the sum of the ensemble variance and the squared error of the ensemble mean

Read more

Summary

INTRODUCTION

Operational numerical weather prediction (NWP) centres use a range of metrics to monitor and communicate forecast performance and make decisions about model upgrades These metrics, which summarise information contained in forecasts and verifying analyses and convert them into scalar values, can broadly be divided into two categories. In this article we shall demonstrate further that, in a bulk sense and under certain conditions, the CRPS is a function of the RMSE of the ensemble members This RMSE can be related to the squared difference between the ensemble mean and the verifying analysis, which is in itself a deterministic verification metric. In this way, the CRPS–RMSE relationship may draw a link between deterministic and probabilistic verification.

Preliminaries
Continuous Ranked Probability Score
Root-mean-square error
DERIVATION O F THE CRPS–RMSE RELATIONSHIP
THE CRPS–RMSE RELATIONSHIP IN AN IDEALISED 2D TURBULENCE MODEL
Experimental design
Results
Non-normality of the ensemble distribution
DISCUSSION AND SUMMARY
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call