Accurate prediction of daily river flow (Qt) remains a challenging yet essential task in hydrological modeling, particularly crucial for flood mitigation and water resource management. This study introduces an advanced M5 Prime (M5P) predictive model designed to estimate Qt as well as one- and two-day-ahead river flow forecasts (i.e. Qt+1 and Qt+2). The predictive performance of M5P ensembles incorporating Bootstrap Aggregation (BA), Disjoint Aggregating (DA), Additive Regression (AR), Vote (V), Iterative classifier optimizer (ICO), Random Subspace (RS), and Rotation Forest (ROF) were comprehensively evaluated. The proposed models were applied to a case study data in Tuolumne County, US, using a dataset comprising measured precipitation (Pt), evaporation (Et), and Qt. A wide range of input scenarios were explored for predicting Qt, Qt+1, and Qt+2. Results indicate that Pt and Qt significantly influence prediction accuracy. Notably, relying solely on the most correlated variable (e.g., Qt-1) does not guarantee robust prediction of Qt. However, extending the forecast horizon mitigates the influence of low-correlation input variables on model accuracy. Performance metrics indicate that the DA-M5P model achieves superior results, with Nash-Sutcliff Efficiency of 0.916 and root mean square error of 23 m3/s, followed by ROF-M5P, BA-M5P, AR-M5P, AR-M5P, RS-M5P, V-M5P, ICO-M5P, and the standalone M5P model. The ensemble M5P modeling framework enhanced the predictive capability of the stand-alone M5P algorithm by 1.2 %–22.6 %, underscoring its efficacy and potential for advancing hydrological forecasting.
Read full abstract