Abstract
The bias of the empirical error rate in supervised classification is studied. It is shown that this bias can be understood as a covariance between the classification rule and the labeling of the training data. From this result, a new penalized criterion is proposed to perform model selection in classification. Applications of the resulting algorithm to simulated and real data are presented.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have