Abstract

The bias of the empirical error rate in supervised classification is studied. It is shown that this bias can be understood as a covariance between the classification rule and the labeling of the training data. From this result, a new penalized criterion is proposed to perform model selection in classification. Applications of the resulting algorithm to simulated and real data are presented.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call