Abstract

Nonparametric regression gives better flexibility because the form of the regression curve estimation adjusts to the data. One nonparametric regression method is spline truncation. The number of knots and their locations affect the form of this regression curve estimation. The optimal knot is needed in order to obtain the best model. There are methods to select optimal knots, such as unbiased risk (UBR) and cross-validation (CV). This paper discusses UBR and CV, then, using both simulated data and the unemployment rate data of Central Java Province, Indonesia, in 2015, compares UBR and CV for selecting the optimal knots. The criteria for selecting the best model were based on Mean Squared Error and R-square. The simulation was performed on a spline truncated function with error generated from normal distribution for varied sample sizes and error variance. The results of the simulation study showed that CV estimates the knots more accurately than UBR. From the application to unemployment rate data, the optimal knot by using CV was a combination of 2-3-2-1-3 knot with MSE of 0.3946 and R-square of 93.047%. Meanwhile, by using UBR, the optimal knot was a three knot with MSE of 0.6865 and R-square of 90.59%. In conclusion, from the results of simulation data and application to unemployment rate data, the CV method generated a better model than the UBR method.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call