Abstract

Sediment load in fluvial systems is one of the critical factors shaping the river geomorphological and hydraulic characteristics. A detailed understanding of the total sediment load (TSL) is required for the protection of physical, environmental, and ecological functions of rivers. This study develops a robust methodological approach based on multiple linear regression (MLR) and support vector regression (SVR) models modified by principal component analysis (PCA) to predict the TSL in rivers. A database of sediment measurement from large-scale physical modelling tests with 4759 datapoints were used to develop the predictive model. A dimensional analysis was performed based on the literature, and ten dimensionless parameters were identified as the key drivers of the TSL in rivers. These drivers were converted to uncorrelated principal components to feed the MLR and SVR models (PCA-based MLR and PCA-based SVR models) developed within this study. A stepwise PCA-based MLR and a 10-fold PCA-based SVR model with different kernel-type functions were tuned to derive an accurate TSL predictive model. Our findings suggest that the PCA-based SVR model with the kernel-type radial basis function has the best predictive performance in terms of statistical error measures including the root-mean-square error normalized with the standard deviation (RMSE/StD) and the Nash–Sutcliffe coefficient of efficiency (NSE), for the estimation of the TSL in rivers. The PCA-based MLR and PCA-based SVR models, with an overall RMSE/StD of 0.45 and 0.35, respectively, outperform the existing well-established empirical formulae for TSL estimation. The analysis of the results confirms the robustness of the proposed PCA-based SVR model for prediction of the cases with high concentration of sediments (NSE = 0.68), where the existing sediment estimation models usually have poor performance.

Highlights

  • Natural and anthropogenically driven forces during the Anthropocene are threatening the sustainable function of rivers by introducing large sediment loads to the freshwater ecosystems and fluvial systems

  • We examined the performance of the developed principal component analysis (PCA)-based support vector regression (SVR) model with different kernel-type functions to propose an accurate and robust total sediment load (TSL) estimation model for rivers based on the ten uncorrelated principal components (PCs) obtained from the conversion of the highly influential drivers given in Equation (3)

  • We use the Nash–Sutcliffe coefficient of efficiency (NSE) and root-mean-square error (RMSE)/standard deviation (StD) to evaluate the performance of the developed PCA-based multiple linear regression (MLR) and PCA-based SVR models

Read more

Summary

Introduction

Natural and anthropogenically driven forces during the Anthropocene are threatening the sustainable function of rivers by introducing large sediment loads to the freshwater ecosystems and fluvial systems. Due to the difficulty in measuring some of the effective parameters that influence the transport of sediments in rivers (hereafter referred to as “drivers”), the available empirical-based equations and data driven models for the estimation of TSL only rely on few measurable variables in rivers. A comprehensive review of the drivers of the TSL is necessary to derive more robust formulae for the estimation of the TSL for a range of hydro-environmental and geomorphological conditions Another challenge associated with the existing empirical-based models is their applicability for extreme TSL conditions, given that such models are derived based on a limited range of sediment concentration in rivers [30]. Empirical-based models provide reasonable accuracy for TSL estimation under the usual mild environmental conditions, the main concern is always associated with the extreme events that introduce high concentration of sediments into the rivers. Hydrology 2022, 9, 36 of the cases with high concentration of sediments, where the existing sediment estimation models usually have poor performance

Dimensional Analysis
Database
Development of the TSL Regression-Based Models
Experiments
Development of PCA-Based SVR Model for TSL Prediction
Statistical Measures
Pre-Processing Data Using PCA
PCA-Based MLR Results
PCA-Based SVR Results
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call