Accurate and reliable forecasting of monthly runoff considering several years of rehabilitation helps in planning and managing the water resources system of bauxite mining areas. A combination of linear regression models and artificial intelligence was proposed and applied to predict runoff from mined areas. The ideal number of input variables was chosen considering a correction threshold (correlation above 95%), stepwise selection, and according to the recursive feature elimination (RFE). The linear regression equations were divided into explanatory and predictive, with the predictive ones being trained and validated with all possible combinations of annual data periods. Five machine learning models (linear model (LM), K-nearest neighbor regression (KNN), support vector machine (SVM), random forest (RF), and Cubist) were trained and used as a prediction instrument on the test set of each subsequent year. Precipitation, accumulated precipitation duration, maximum precipitation intensity, and number of events were the most explanatory variables in most linear regression and machine learning equations. The linear regression model approach presented satisfactory performance indices in predicting surface runoff in a mined bauxite area. The Cubist and RF models were the best predictors for the greatest number of years of rehabilitation, with a coefficient of determination varying with maximum values between 0.52 and 0.88 in validation. There was no best model that consistently presented the best result in all years of rehabilitation, and it is recommended to use the annual model for the respective year of rehabilitation. The linear regression equations and algorithms were useful tools in predicting runoff series and, therefore, promising monthly runoff prediction. The presented approach becomes a starting point in the search to optimize the hydrological planning of surface bauxite mines.
Read full abstract