Purpose The purpose of this study is to propose a systematic model selection procedure for long-term load forecasting (LTLF) for ex-ante and ex-post cases considering uncertainty in exogenous predictors. Design/methodology/approach The different variants of regression models, namely, Polynomial Regression (PR), Generalised Additive Model (GAM), Quantile Polynomial Regression (QPR) and Quantile Spline Regression (QSR), incorporating uncertainty in exogenous predictors like population, Real Gross State Product (RGSP) and Real Per Capita Income (RPCI), temperature and indicators of breakpoints and calendar effects, are considered for LTLF. Initially, the Backward Feature Elimination procedure is used to identify the optimal set of predictors for LTLF. Then, the consistency in model accuracies is evaluated using point and probabilistic forecast error metrics for ex-ante and ex-post cases. Findings From this study, it is found PR model outperformed in ex-ante condition, while QPR model outperformed in ex-post condition. Further, QPR model performed consistently across validation and testing periods. Overall, QPR model excelled in capturing uncertainty in exogenous predictors, thereby reducing over-forecast error and risk of overinvestment. Research limitations/implications These findings can help utilities to align model selection strategies with their risk tolerance. Originality/value To propose the systematic model selection procedure in this study, the consistent performance of PR, GAM, QPR and QSR models are evaluated using point forecast accuracy metrics Mean Absolute Percentage Error, Root Mean Squared Error and probabilistic forecast accuracy metric Pinball Score for ex-ante and ex-post cases considering uncertainty in the considered exogenous predictors such as RGSP, RPCI, population and temperature.
Read full abstract