Estimation Methods for Nonhomogeneous Regression Models: Minimum Continuous Ranked Probability Score versus Maximum Likelihood

Manuel Gebetsberger,Georg J Mayr,Jakob W Messner,Achim Zeileis

doi:10.1175/mwr-d-17-0364.1

Manuel Gebetsberger, Georg J Mayr + Show 2 more

Open Access

https://doi.org/10.1175/mwr-d-17-0364.1

Copy DOI

Journal: Monthly Weather Review	Publication Date: Dec 1, 2018
Citations: 36	License type: CC BY 4.0

Affiliation: Universität Innsbruck

Abstract

Abstract Nonhomogeneous regression models are widely used to statistically postprocess numerical ensemble weather prediction models. Such regression models are capable of forecasting full probability distributions and correcting for ensemble errors in the mean and variance. To estimate the corresponding regression coefficients, minimization of the continuous ranked probability score (CRPS) has widely been used in meteorological postprocessing studies and has often been found to yield more calibrated forecasts compared to maximum likelihood estimation. From a theoretical perspective, both estimators are consistent and should lead to similar results, provided the correct distribution assumption about empirical data. Differences between the estimated values indicate a wrong specification of the regression model. This study compares the two estimators for probabilistic temperature forecasting with nonhomogeneous regression, where results show discrepancies for the classical Gaussian assumption. The heavy-tailed logistic and Student’s t distributions can improve forecast performance in terms of sharpness and calibration, and lead to only minor differences between the estimators employed. Finally, a simulation study confirms the importance of appropriate distribution assumptions and shows that for a correctly specified model the maximum likelihood estimator is slightly more efficient than the CRPS estimator.

Highlights

Each trained model is applied on raw ensemble data of the remaining test subsample. This leads to out-of-sample forecasts not used for training, which are verified with PIW, prediction interval coverage (PIC), reliability index (RI), log score (LS), and continuous ranked probability score (CRPS)
Nonhomogeneous regression is a commonly used postprocessing strategy to statistically correct numerical weather prediction models (NWP) ensemble forecasts. This approach predicts the outcome of weather quantities of interest with full parametric forecast distributions
Log-score (LS) minimization has a long tradition in statistical modeling, whereas CRPS minimization has become popular in meteorological studies

Summary

Introduction

Nonhomogeneous regression is a popular regressionbased technique to statistically correct an ensemble of numerical weather prediction models (NWP; Leith 1974). For perfect calibration the probability integral transform (PIT) should be distributed uniformly Both estimation approaches, LS and CRPS minimization, show a hump in the center bins indicating overdispersive forecasts (i.e., the forecast distribution is too wide so that observations fall overproportionally into the central range of the distribution). Both estimation approaches show only small differences and much better calibration. In this article we set out to investigate when and why results from LS and CRPS minimization will differ for symmetric distribution assumptions This is performed in terms of temperature forecasting in central Europe and with simulated data using the NGR as the benchmark approach.

Methods

Probabilistic temperature forecasting

Simulation study

Findings

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Estimation Methods for Nonhomogeneous Regression Models: Minimum Continuous Ranked Probability Score versus Maximum Likelihood

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Monthly Weather Review

Lead the way for us

Similar Papers

Statistical Methods to Study Timing of Vulnerability with Sparsely Sampled Data on Environmental Toxicants
Brisa Ney Sánchez ... Howard Hu
Environmental Health Perspectives | VOL. 119
Brisa Ney Sánchez, et. al.Brisa Ney Sánchez ... Howard Hu
08 Dec 2010
Environmental Health Perspectives | VOL. 119

Confirmatory Factor Analyses in Psychological Test Adaptation and Development
Kay Brauer ... Matthias Ziegler
Psychological Test Adaptation and Development | VOL. 4
Kay Brauer, et. al.Kay Brauer ... Matthias Ziegler
01 Feb 2023
Psychological Test Adaptation and Development | VOL. 4

Maximum Likelihood and Semiparametric Estimation in Logistic Models with Incomplete Covariate Data

-

01 Jan 1998
01 Jan 1998

Added-value of ensemble prediction system on the quality of solar irradiance probabilistic forecasts
Josselin Le Gal La Salle ... Mathieu David
Renewable Energy | VOL. 162
Josselin Le Gal La Salle, et. al.Josselin Le Gal La Salle ... Mathieu David
22 Aug 2020
Renewable Energy | VOL. 162

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Estimation Methods for Nonhomogeneous Regression Models: Minimum Continuous Ranked Probability Score versus Maximum Likelihood

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Monthly Weather Review