How do I know if my forecasts are better? Using benchmarks in hydrological ensemble prediction

F Pappenberger,M.H Ramos,H.L Cloke,F Wetterhall,L Alfieri,K Bogner,A Mueller,P Salamon

doi:10.1016/j.jhydrol.2015.01.024

Abstract

Summary The skill of a forecast can be assessed by comparing the relative proximity of both the forecast and a benchmark to the observations. Example benchmarks include climatology or a naive forecast. Hydrological ensemble prediction systems (HEPS) are currently transforming the hydrological forecasting environment but in this new field there is little information to guide researchers and operational forecasters on how benchmarks can be best used to evaluate their probabilistic forecasts. In this study, it is identified that the forecast skill calculated can vary depending on the benchmark selected and that the selection of a benchmark for determining forecasting system skill is sensitive to a number of hydrological and system factors. A benchmark intercomparison experiment is then undertaken using the continuous ranked probability score (CRPS), a reference forecasting system and a suite of 23 different methods to derive benchmarks. The benchmarks are assessed within the operational set-up of the European Flood Awareness System (EFAS) to determine those that are ‘toughest to beat’ and so give the most robust discrimination of forecast skill, particularly for the spatial average fields that EFAS relies upon. Evaluating against an observed discharge proxy the benchmark that has most utility for EFAS and avoids the most naive skill across different hydrological situations is found to be meteorological persistency. This benchmark uses the latest meteorological observations of precipitation and temperature to drive the hydrological model. Hydrological long term average benchmarks, which are currently used in EFAS, are very easily beaten by the forecasting system and the use of these produces much naive skill. When decomposed into seasons, the advanced meteorological benchmarks, which make use of meteorological observations from the past 20 years at the same calendar date, have the most skill discrimination. They are also good at discriminating skill in low flows and for all catchment sizes. Simpler meteorological benchmarks are particularly useful for high flows. Recommendations for EFAS are to move to routine use of meteorological persistency, an advanced meteorological benchmark and a simple meteorological benchmark in order to provide a robust evaluation of forecast skill. This work provides the first comprehensive evidence on how benchmarks can be used in evaluation of skill in probabilistic hydrological forecasts and which benchmarks are most useful for skill discrimination and avoidance of naive skill in a large scale HEPS. It is recommended that all HEPS use the evidence and methodology provided here to evaluate which benchmarks to employ; so forecasters can have trust in their skill evaluation and will have confidence that their forecasts are indeed better.

Highlights

River flow forecasts are used to make decisions on upcoming floods and low flows/droughts by hydro-meteorological agencies around the world (Pagano et al, 2013; Wetterhall et al, 2013)
This paper focuses on the use of benchmarks in the evaluation of skill of ensemble or probabilistic hydrological forecasts made by hydrological ensemble prediction systems (HEPS)
This paper considers how best to use benchmarks for forecast evaluation in HEPS, which is useful for automatic quality checking of large scale forecasts and for when forecasting system upgrades are made

Summary

Introduction

River flow forecasts are used to make decisions on upcoming floods and low flows/droughts by hydro-meteorological agencies around the world (Pagano et al, 2013; Wetterhall et al, 2013) The forecasts from these operational systems are evaluated in terms of the degree of similarity between some verification data, such as observations of river discharge, and the forecast (Demargne et al, 2009). Another important component of the forecast evaluation is whether the forecasts add value or have skill compared to climatology or another simple ‘best guess’ (Luo et al, 2012; Perrin et al, 2006; Fewtrell et al, 2011). For instance, when evaluating the gain in performance when additional procedures or new developments are introduced into the forecasting system, such as data assimilation or post-processing techniques

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Hydrology	Publication Date: Jan 20, 2015
Citations: 171	License type: cc-by

R Discovery Prime

R Discovery Prime

How do I know if my forecasts are better? Using benchmarks in hydrological ensemble prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Hydrology

Lead the way for us

Similar Papers

Hydrological ensemble prediction systems
Hannah L Cloke ... Schalk Jan Van Andel
Hydrological Processes | VOL. 27
Hannah L Cloke, et. al.Hannah L Cloke ... Schalk Jan Van Andel
21 Dec 2012
Hydrological Processes | VOL. 27

Exploring the relationship between the skill of hydrological ensemble predictions and catchment descriptors
Emixi Valdez ... Francois Anctil
-
Emixi Valdez, et. al.Emixi Valdez ... Francois Anctil
23 Mar 2020
23 Mar 2020

The european flood alert system EFAS – Part 2: Statistical skill assessment of probabilistic and deterministic operational forecasts
J C Bartholmes ... M H Ramos
Hydrology and Earth System Sciences | VOL. 13
J C Bartholmes, et. al.J C Bartholmes ... M H Ramos
05 Feb 2009
Hydrology and Earth System Sciences | VOL. 13

HESS Opinions "Forecaster priorities for improving probabilistic flood forecasts"
F Wetterhall ...
Hydrology and Earth System Sciences | VOL. 17
F Wetterhall, et. al.F Wetterhall ...
07 Nov 2013
Hydrology and Earth System Sciences | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

How do I know if my forecasts are better? Using benchmarks in hydrological ensemble prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Hydrology