Competing Forecast Verification: Using the Power-Divergence Statistic for Testing the Frequency of “Better”

Eric Gilleland,David D Turner,Domingo Muñoz-Esparza

doi:10.1175/waf-d-22-0201.1

Abstract

Abstract When testing hypotheses about which of two competing models is better, say A and B, the difference is often not significant. An alternative, complementary approach, is to measure how often model A is better than model B regardless of how slight or large the difference. The hypothesis concerns whether or not the percentage of time that model A is better than model B is larger than 50%. One generalized test statistic that can be used is the power-divergence test, which encompasses many familiar goodness-of-fit test statistics, such as the loglikelihood-ratio and Pearson X2 tests. Theoretical results justify using the distribution for the entire family of test statistics, where k is the number of categories. However, these results assume that the underlying data are independent and identically distributed, which is often violated. Empirical results demonstrate that the reduction to two categories (i.e., model A is better than model B versus model B is better than A) results in a test that is reasonably robust to even severe departures from temporal independence, as well as contemporaneous correlation. The test is demonstrated on two different example verification sets: 6-h forecasts of eddy dissipation rate (m2/3 s−1) from two versions of the Graphical Turbulence Guidance model and for 12-h forecasts of 2-m temperature (°C) and 10-m wind speed (m s−1) from two versions of the High-Resolution Rapid Refresh model. The novelty of this paper is in demonstrating the utility of the power-divergence statistic in the face of temporally dependent data, as well as the emphasis on testing for the “frequency-of-better” alongside more traditional measures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Competing Forecast Verification: Using the Power-Divergence Statistic for Testing the Frequency of “Better”

Abstract

Talk to us

Similar Papers

More From: Weather and Forecasting

Lead the way for us

Similar Papers

Verification of the Global Forecast System, North American Mesoscale Forecast System, and High-Resolution Rapid Refresh Model Near-Surface Forecasts by Use of the New York State Mesonet
Lauriana C Gaudet ... Ryan D Torn
Weather and Forecasting | VOL. 39
Lauriana C Gaudet, et. al.Lauriana C Gaudet ... Ryan D Torn
01 Feb 2024
Weather and Forecasting | VOL. 39

Evaluation of a cloudy cold-air pool in the Columbia River basin in different versions of the High-Resolution Rapid Refresh (HRRR) model
Bianca Adler ... Irina V Djalalova
Geoscientific Model Development | VOL. 16
Bianca Adler, et. al.Bianca Adler ... Irina V Djalalova
26 Jan 2023
Geoscientific Model Development | VOL. 16

Increasing destructive potential of extratropical transition events in response to higher CO2 concentration in global climate model
Hung Ming Cheung ... Jung-Eun Chu
-
Hung Ming Cheung, et. al.Hung Ming Cheung ... Jung-Eun Chu
15 May 2023
15 May 2023

Correlation-Based Inference for Linkage Disequilibrium With Multiple Alleles
Dmitri V Zaykin ... Bruce S Weir
Genetics | VOL. 180
Dmitri V Zaykin, et. al.Dmitri V Zaykin ... Bruce S Weir
01 Sep 2008
Genetics | VOL. 180

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Competing Forecast Verification: Using the Power-Divergence Statistic for Testing the Frequency of “Better”

Abstract

Talk to us

Similar Papers

More From: Weather and Forecasting