This study utilizes ultraviolet and fluorescence spectroscopic indices of dissolved organic matter (DOM) from sediments, combined with machine learning (ML) models, to develop an optimized predictive model for estimating sediment total organic carbon (TOC) and identifying adjacent land-use types in coastal sediments from the Yellow and Bohai Seas. Our results indicate that ML models surpass traditional regression techniques in estimating TOC and classifying land-use types. Penalized Least Squares Regression (PLR) and Cubist models show exceptional TOC estimation capabilities, with PLR exhibiting the lowest training error and Cubist achieving a correlation coefficient 0.79. In land-use classification, Support Vector Machines achieved 85.6 % accuracy in training and 92.2 % in testing. Maximum fluorescence intensity and ultraviolet absorbance at 254 nm were crucial factors influencing TOC variations in coastal sediments. This study underscores the efficacy of ML models utilizing DOM optical indices for near real-time estimation of marine sediment TOC and land-use classification.
Read full abstract