Abstract

This paper studies the problem of local bandwidth selection for local linear regression. It is known that the optimal local bandwidth for estimating the unknown curve f at design point x depends on the curve’s second derivative f''(x) at x. Therefore one could select the local bandwidth h(x) at x via estimating f''(x). However, as typically estimating f''(x) is a much harder task than estimating f(x) itself, this approach for choosing h(x) tends to produce less accurate results. This paper proposes a method for choosing h(x) that bypasses the estimation of f''(x), yet at the same time utilizes the useful fact that the optimal local bandwidth depends on f''(x). The main idea is to first partition the domain of f(x) into different segments for which the second derivative of each segment is approximately constant. The number and the length of the segments are assumed unknown and will be estimated. Then, after such a partition is obtained, any reliable, well-studied global bandwidth selection method can be applied to choose the bandwidth for each segment. The empirical performance of the proposed local bandwidth selection method is evaluated by numerical experiments.

Highlights

  • Local linear regression is a popular method for nonparametric curve estimation

  • If the curve demonstrates a large amount of spatial inhomogeneities, local bandwidth smoothing, sometimes known as variable bandwidth smoothing, should be used

  • In this article a method is proposed for choosing the bandwidth function for local linear smoothing

Read more

Summary

Introduction

Local linear regression is a popular method for nonparametric curve estimation. An important aspect in its implementation is the choice for the amount of smoothing; i.e., the selection of the so-called bandwidth. The step is to calculate a single (global) bandwidth for each segment These bandwidths are joined together to form a piecewise constant function h(x); see Figure 1(c). This final bandwidth function is used to estimate the unknown curve.

Background
Second differencing
Second derivative segmentation using minimum description length
Partial local smoothing
Summary
Theoretical properties
Simulation results
Regularly spaced data
Non-uniform design densities
Real data
Concluding remarks
The banded Toeplitz matrix V
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call