Cross-Validation With Confidence

Jing Lei

doi:10.1080/01621459.2019.1672556

Abstract

Cross-validation is one of the most popular model and tuning parameter selection methods in statistics and machine learning. Despite its wide applicability, traditional cross-validation methods tend to overfit, due to the ignorance of the uncertainty in the testing sample. We develop a novel statistically principled inference tool based on cross-validation that takes into account the uncertainty in the testing sample. This method outputs a set of highly competitive candidate models containing the optimal one with guaranteed probability. As a consequence, our method can achieve consistent variable selection in a classical linear regression setting, for which existing cross-validation methods require unconventional split ratios. When used for tuning parameter selection, the method can provide an alternative trade-off between prediction accuracy and model interpretability than existing variants of cross-validation. We demonstrate the performance of the proposed method in several simulated and real data examples. Supplemental materials for this article can be found online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cross-Validation With Confidence

Abstract

Talk to us

Similar Papers

More From: Journal of the American Statistical Association

Lead the way for us

Journal: Journal of the American Statistical Association	Publication Date: Oct 31, 2019
Citations: 49

Similar Papers

Tuning Parameter Selection Based on Blocked $$3\times 2$$ Cross-Validation for High-Dimensional Linear Regression Model
Xingli Yang ... Ruibo Wang
Neural Processing Letters | VOL. 51
Xingli Yang, et. al.Xingli Yang ... Ruibo Wang
15 Oct 2019
Neural Processing Letters | VOL. 51

Sensors support machine learning
-
Food Science and Technology | VOL. 33
--
01 Dec 2019
Food Science and Technology | VOL. 33

MO360MACHINE LEARNING MODELS FOR PREDICTING ACUTE KIDNEY INJURY: A SYSTEMATIC REVIEW
Iacopo Vagliano ... Ameen Abu Hanna
Nephrology Dialysis Transplantation | VOL. 36
Iacopo Vagliano, et. al.Iacopo Vagliano ... Ameen Abu Hanna
29 May 2021
Nephrology Dialysis Transplantation | VOL. 36

Predicting seismic collapse probability of the building isolated with triple friction pendulums using machine learning
Yanqing Xu ... Ruijun Zhang
Structures | VOL. 58
Yanqing Xu, et. al.Yanqing Xu ... Ruijun Zhang
31 Oct 2023
Structures | VOL. 58

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross-Validation With Confidence

Abstract

Talk to us

Similar Papers

More From: Journal of the American Statistical Association