Abstract

Statistical models play an important role in elucidating the dynamic behaviors of surface water quality, given limited data on a large scale. In this study, we examine alternative approaches to develop regression models that predict fecal coliform (FC) concentrations in a river using different methods for selecting important variables provided by a self-organizing map (SOM). The raw data used as input to the SOM included 11 water quality, 6 meteorological, and 7 land use parameters that were monitored along the Yeongsan River in Korea on various time scales (from daily to half a decade) during 1996–2008. In both test and validation data sets, (multiple) regressions using backward elimination were compared against regression models via forced entry, which included a set of ranked variables simultaneously based on four indices in the SOM (i.e. structuring index, relative importance, cluster description, and Spearman’s rank correlation). Results showed that the SOM effectively illustrated the complex relationship between FC and the remaining variables in the entire data set. This relationship was seen more clearly in homogeneous clusters, indicating that the regression models became more robust in each subdivided group. While the original backward elimination model (R2 = 0.66) had much better performance than the models with four indices (R2 = 0.40–0.45) in the test data set, its performance (R2 = 0.42) was quite comparable to the relative importance model (R2 = 0.38) in the validation data set. Based on this preliminary study, we recommend further investigation of these indices for a reliable regression analysis, as the t values currently used for the variable selection in regressions provide only a locally optimal solution for the final model. The proposed methodology, if verified successfully, would be useful in developing early warning models that control mortality or disease rates of fishes in high-density aquafarms via water quality.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.