Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - Part 2: Application

A Elshorbagy,D P Solomatine,S Srinivasulu,G Corzo

doi:10.5194/hess-14-1943-2010

Abstract

Abstract. In this second part of the two-part paper, the data driven modeling (DDM) experiment, presented and explained in the first part, is implemented. Inputs for the five case studies (half-hourly actual evapotranspiration, daily peat soil moisture, daily till soil moisture, and two daily rainfall-runoff datasets) are identified, either based on previous studies or using the mutual information content. Twelve groups (realizations) were randomly generated from each dataset by randomly sampling without replacement from the original dataset. Neural networks (ANNs), genetic programming (GP), evolutionary polynomial regression (EPR), Support vector machines (SVM), M5 model trees (M5), K-nearest neighbors (K-nn), and multiple linear regression (MLR) techniques are implemented and applied to each of the 12 realizations of each case study. The predictive accuracy and uncertainties of the various techniques are assessed using multiple average overall error measures, scatter plots, frequency distribution of model residuals, and the deterioration rate of prediction performance during the testing phase. Gamma test is used as a guide to assist in selecting the appropriate modeling technique. Unlike two nonlinear soil moisture case studies, the results of the experiment conducted in this research study show that ANNs were a sub-optimal choice for the actual evapotranspiration and the two rainfall-runoff case studies. GP is the most successful technique due to its ability to adapt the model complexity to the modeled data. EPR performance could be close to GP with datasets that are more linear than nonlinear. SVM is sensitive to the kernel choice and if appropriately selected, the performance of SVM can improve. M5 performs very well with linear and semi linear data, which cover wide range of hydrological situations. In highly nonlinear case studies, ANNs, K-nn, and GP could be more successful than other modeling techniques. K-nn is also successful in linear situations, and it should not be ignored as a potential modeling technique for hydrological applications.

Highlights

The research methodology explained in the first part of this two-companion paper was implemented in the sequence presented earlier
It is certainly useful to judge techniques based on the range of performances, if a single value is needed, one has to rely on the average performance
If a technique is better than the rest with respect to two different error measures (e.g., RMSE and R), this can be a strong indication of the superiority of such a technique

Summary

Introduction

The research methodology explained in the first part of this two-companion paper was implemented in the sequence presented earlier. Inputs of the various models were identified. A mixed approach of input selection was adopted since identification of optimum inputs was not in itself one of the objectives of this study. The section describes the five different datasets. The two soil moisture datasets (Elshorbagy and Parasuraman, 2008) and a reduced hourly version of the evapotranspiration (AET) dataset (Parasuraman and Elshorbagy, 2008; Parasuraman et al, 2007) were used in earlier studies. This study benefited from the input structure identified in the earlier studies, and sometimes (e.g., the case of the evapotranspiration dataset) enhanced the input structure by considering more inputs identified using the mutual information content

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Hydrology and Earth System Sciences	Publication Date: Oct 14, 2010
Citations: 142	License type: CC BY 3.0

R Discovery Prime

R Discovery Prime

Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - Part 2: Application

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Hydrology and Earth System Sciences

Lead the way for us

Similar Papers

Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - Part 1: Concepts and methodology
A Elshorbagy ... S Srinivasulu
Hydrology and Earth System Sciences | VOL. 14
A Elshorbagy, et. al.A Elshorbagy ... S Srinivasulu
14 Oct 2010
Hydrology and Earth System Sciences | VOL. 14

SMAP 토양수분 이미지를 이용한 농업가뭄 평가 기법 개발
Yongchul Shin ... Kyung-Sook Choi
Journal of The Korean Society of Agricultural Engineers | VOL. 59
Yongchul Shin, et. al.Yongchul Shin ... Kyung-Sook Choi
31 Jan 2017
Journal of The Korean Society of Agricultural Engineers | VOL. 59

Reliability assessment of water quality index based on guidelines of national sanitation foundation in natural streams: integration of remote sensing and data-driven models
Mohammad Najafzadeh ... Hadi Farhadi
Artificial Intelligence Review | VOL. 54
Mohammad Najafzadeh, et. al.Mohammad Najafzadeh ... Hadi Farhadi
24 Apr 2021
Artificial Intelligence Review | VOL. 54

Comparison of three data-driven techniques in modelling the evapotranspiration process
I El-Baroudy ... D Savic
Journal of Hydroinformatics | VOL. 12
I El-Baroudy, et. al.I El-Baroudy ... D Savic
26 Mar 2010
Journal of Hydroinformatics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - Part 2: Application

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Hydrology and Earth System Sciences