The purpose of this research is to seek a better inversion algorithm. And on this basis, it explores the feasibility of using hyperspectral monitoring technology instead of laboratory physical and chemical index test and evaluates the prediction effect of inversion model on water quality change. So as to be more convenient, more economical and extensive monitoring methods for water quality monitoring of urban internal river are provided. This paper takes the water samples collected in Fuyang River in downtown Handan as the research object and obtains original spectral data of the samples by the ASD FieldSpec 4 field hyperspectral spectrometer. After the smoothing filter pretreatment by the Savitzky-Golay (SG) method and specified mathematical transformations, the modeling spectral indicators of various water quality parameters are selected and determined by calculating the maximum mean of absolute values for correlation coefficients of various spectral indicators and measured values in the wavelength range from 400 to 950nm. By introducing partial least squares (PLS), random forest (RF), and Lasso (least absolute shrinkage and selection operator), six water quality parameter fitting models were constructed including turbidity (Turb), suspended substance (SS), chemical oxygen demand (COD), NH4-N, total nitrogen (TN), and total phosphorus (TP), which are also testified and evaluated through hyperspectral data. The results show that different spectral transformation methods highlight different information inversion effects. The first derivative of reciprocal logarithm of spectral data after SG smoothing has a good modeling effect on four water quality parameters including Turb, COD, NH4-N, and TP; and the first derivative of smoothed spectral data has a good modeling effect on both water quality parameters of SS and TN. Among the three models, the PLS model has a good prediction effect, with the [Formula: see text] for COD, TN, and TP ranging from 0.74 to 0.80, while that for Turb and SS shows relatively poorer prediction effect, followed by even worse effect on HN4-H. Both machine learning algorithms of RF and Lasso have respectively obtained the best prediction models for different water quality parameters. The Lasso model has a [Formula: see text] value above 0.8 for water body organic pollutants COD, TN, and TP, and the decrease value for [Formula: see text] and [Formula: see text] is below 0.1, which indicates that the model has high prediction accuracy and strong generalization ability, but the results of SS and NH4-N do not meet the expected accuracy. In the inversion model of RF for COD, [Formula: see text] is higher than [Formula: see text], which shows excellent performance, and has certain prediction ability for SS and NH4-N. The RF model and Lasso model complement each other effectively in applicability and prediction accuracy. Compared with the traditional regression model PLS, machine learning has obvious overall advantages, making it more suitable for classified inversion prediction of urban river water quality parameters.
Read full abstract