Cross-validation is dead. Long live cross-validation! Model validation based on resampling

Knut Baumann

doi:10.1186/1758-2946-2-s1-o5

Abstract

Cross-validation was originally invented to estimate the prediction error of a mathematical modelling procedure. It can be shown that cross-validation estimates the prediction error almost unbiasedly. Nonetheless, there are numerous reports in the chemoinformatic literature that cross-validated figures of merit cannot be trusted and that a so-called external test set has to be used to estimate the prediction error of a mathematical model. In most cases where cross-validation fails to estimate the prediction error correctly, this can be traced back to the fact that it was employed as an objective function for model selection. Typically each model has some meta-parameters that need to be tuned such as the choice of the actual descriptors and the number of variables in a QSAR equation, the network topology of a neural net, or the complexity of a decision tree. In this case the meta-parameter is varied and the cross-validated prediction error is determined for each setting. Finally, the parameter setting is chosen that optimizes the cross-validated prediction error in an attempt to optimize the predictivity of the model. However, in these cases cross-validation is no longer an unbiased estimator of the prediction error and may grossly deviate from the result of an external test set. It can be shown that the amount of model selection can directly be related to the inflation of cross-validated figures of merit. Hence, the model selection step has to be separated from the step of estimating the prediction error. If this is done correctly, cross-validation (or resampling in general) retains its property of unbiasedly estimating the prediction error. Matter of factly, it can be shown that data splitting into a training set and an external test set often estimates the prediction error less precise than proper cross-validation. It is this variabability of prediction errors, which depends on test set size, that causes seemingly paradox phenomena such as the so-called Kubinyi's paradoxon for small data sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Cheminformatics	Publication Date: May 1, 2010
Citations: 7	License type: CC BY-NC 2.0

R Discovery Prime

R Discovery Prime

Cross-validation is dead. Long live cross-validation! Model validation based on resampling

Abstract

Talk to us

Similar Papers

More From: Journal of Cheminformatics

Lead the way for us

Similar Papers

Reliable estimation of externally validated prediction errors for QSAR models
Désirée Baumann ... Knut Baumann
Journal of Cheminformatics | VOL. 5
Désirée Baumann, et. al.Désirée Baumann ... Knut Baumann
01 Mar 2013
Journal of Cheminformatics | VOL. 5

Robust preprocessing and model selection for spectral data
Sabine Verboven ... Mia Hubert
Journal of Chemometrics | VOL. 26
Sabine Verboven, et. al.Sabine Verboven ... Mia Hubert
07 May 2012
Journal of Chemometrics | VOL. 26

Reliable estimation of prediction errors for QSAR models under model uncertainty using double cross-validation.
Désirée Baumann ... Knut Baumann
Journal of Cheminformatics | VOL. 6
Désirée Baumann, et. al.Désirée Baumann ... Knut Baumann
26 Nov 2014
Journal of Cheminformatics | VOL. 6

Multitask Deep Learning-Based Whole-Process System for Automatic Diagnosis of Breast Lesions and Axillary Lymph Node Metastasis Discrimination from Dynamic Contrast-Enhanced-MRI: A Multicenter Study.
... Jing Gao
Journal of magnetic resonance imaging : JMRI | VOL. 59
, et. al. ... Jing Gao
27 Jul 2023
Journal of magnetic resonance imaging : JMRI | VOL. 59

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross-validation is dead. Long live cross-validation! Model validation based on resampling

Abstract

Talk to us

Similar Papers

More From: Journal of Cheminformatics