Abstract

Scatterplot smoothers estimate a regression function y = f(x) by local averaging of the observed data points (x i , y i ). In using a smoother, the statistician must choose a window width, a crucial smoothing parameter that says just how locally the averaging is done. This paper concerns the data-based choice of a smoothing parameter for splinelike smoothers, focusing on the comparison of two popular methods, C p and generalized maximum likelihood. The latter is the MLE within a normal-theory empirical Bayes model. We show that C p is also maximum likelihood within a closely related nonnormal family, both methods being examples of a class of selection criteria. Each member of the class is the MLE within its own one-parameter curved exponential family. Exponential family theory facilitates a finite-sample nonasymptotic comparison of the criteria. In particular it explains the eccentric behavior of C p , which even in favorable circumstances can easily select small window widths and wiggly estimates of f(x). The theory leads to simple geometric pictures of both C p and MLE that are valid whether or not one believes in the probability models.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call