Model Performance Uncertainty Research Articles

This paper presents first steps toward robust models for crisis prediction. We conduct a horse race of conventional statistical methods and more recent machine learning methods as early-warning models. As individual models are in the literature most often built in isolation of other methods, the exercise is of high relevance for assessing the relative performance of a wide variety of methods. Further, we test various ensemble approaches to aggregating the information products of the built models, providing a more robust basis for measuring country-level vulnerabilities. Finally, we provide approaches to estimating model uncertainty in early-warning exercises, particularly model performance uncertainty and model output uncertainty. The approaches put forward in this paper are shown with Europe as a playground. Generally, our results show that the conventional statistical approaches are outperformed by more advanced machine learning methods, such as k-nearest neighbors and neural networks, and particularly by model aggregation approaches through ensemble learning.

A stormwater quality model should be calibrated and verified against available data before it can be confidently used. This paper mainly examines two questions: how do the size and selection of calibration data sets affect model performances and how should the calibration data sets be selected. Regression models are used to simulate stormwater quality (TSS and COD) with variables characterizing rainfall and flow characteristics. Based on large databases of three catchments in France, several models are calibrated and verified with different data subsets. It is confirmed that the selection of calibration data sets leads to significant uncertainty in model performance. The information content in the calibration data sets is also important in addition to their size. Generally model performances can be improved by using a large size of calibration data sets and by selecting calibration data that are representative of all data. Three methods endeavoring to improve model performance by selecting calibration data either according to model outputs or model inputs are developed based on the principle of choosing calibration data that are representative of the whole data set. The effectiveness of the three selection methods is demonstrated by their application on databases of the three catchments. Model performances can be generally improved by selection methods. The selection methods based on model inputs that consider multi-dimension information perform better than the method with one-dimension information consideration.

Model Performance Uncertainty Research Articles

Related Topics

Articles published on Model Performance Uncertainty

Validating the validation: reanalyzing a large-scale comparison of deep learning and machine learning models for bioactivity prediction

Toward robust early-warning models: a horse race, ensembles and model uncertainty

Toward Robust Early-Warning Models: A Horse Race, Ensembles and Model Uncertainty

Toward Robust Early-Warning Models: A Horse Race, Ensembles and Model Uncertainty

Toward Robust Early-Warning Models: A Horse Race, Ensembles and Model Uncertainty

On calibration data selection: The case of stormwater quality regression models

Evaluation of past and potential phosphorus uptake at the Orlando Easterly Wetland

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Model Performance Uncertainty Research Articles

Related Topics

Articles published on Model Performance Uncertainty

Validating the validation: reanalyzing a large-scale comparison of deep learning and machine learning models for bioactivity prediction

Toward robust early-warning models: a horse race, ensembles and model uncertainty

Toward Robust Early-Warning Models: A Horse Race, Ensembles and Model Uncertainty

Toward Robust Early-Warning Models: A Horse Race, Ensembles and Model Uncertainty

Toward Robust Early-Warning Models: A Horse Race, Ensembles and Model Uncertainty

On calibration data selection: The case of stormwater quality regression models

Evaluation of past and potential phosphorus uptake at the Orlando Easterly Wetland