Highly accurate streamflow simulations are essential for water resources and river management. However, there are longstanding challenges for streamflow modeling in the hydrology field. As a result, deep learning methods have been introduced as novel tools and are being used in simulation. This paper proposes a novel approach that utilizes unconventional data preprocessing techniques to improve the accuracy of daily streamflow predictions in flood-prone regions. We applied this approach to a flood-prone area in western Iran and simulated daily streamflow using a deep learning Gated Recurrent Unit (GRU) method under various data selection scenarios. These scenarios focused on identifying the optimal combination of input variables, time steps, and outlier removal techniques. The outlier removal methods investigated in this study include Mahalanobis distance, critical section removal, Z-score, and no removal. The average rainfall of the area, data driven precipitation, Normalized Difference Vegetation Index (NDVI), surface soil moisture, groundwater baseflows, and streamflow in the hydrometric station were evaluated using correlation control method, and inputs with the lowest correlation were removed. Based on the results obtained from the deep learning models produced in the research, it was found that the GRU model, with various modified inputs using Z-score removal, had the best performance. The model had an average of Root Mean Squared Error (RMSE) at 5.24 mm and R2 (Coefficient of Determination) at 0.91 during training, while during validation, it had an RMSE of 7.74 mm and R2 of 0.83. Considering using relatively low convergence data in streamflow simulation in this study, it can be said that the listed scenarios showed an appropriate result in dealing with the data and recognizing the complex pattern of the daily streamflow, and the future studies will show the improvement of the GRU models if higher convergence data will be used.
Read full abstract