Abstract

We consider a k-nearest neighbor-based nonparametric lack-of-fit test of constant regression in presence of heteroscedastic variances. The asymptotic distribution of the test statistic is derived under the null and local alternatives for a fixed number of nearest neighbors. Advantages of our test compared to classical methods include: (1) The response variable can be discrete or continuous regardless of whether the conditional distribution is symmetric or not and can have variations depending on the predictor. This allows our test to have broad applicability to data from many practical fields; (2) this approach does not need nonlinear regression function estimation that often affects the power for moderate sample sizes; (3) our test statistic achieves the parametric standardizing rate, which gives more power than smoothing-based nonparametric methods for moderate sample sizes. Our numerical simulation shows that the proposed test is powerful and has noticeably better performance than some well known tests when the data were generated from high frequency alternatives or binary data. The test is illustrated with an application to gene expression data and an assessment of Richards growth curve fit to COVID-19 data.

Highlights

  • Zimmermann and Xin GaoNonparametric lack-of-fit tests where the constant regression is assumed for the null hypothesis have been considered by many authors

  • We derived the asymptotic distribution of a nonparametric lack-of-fit test of constant regression in the presence of heteroscedastic variances

  • We considered a test statistic obtained using the augmentation of a small number of k-nearest neighbors defined through the ranks of the predictor variable

Read more

Summary

Introduction

Nonparametric lack-of-fit tests where the constant regression is assumed for the null hypothesis have been considered by many authors. Hart [2] extended the order selection method of Reference [1] to rank-based test under the constant variance assumption so that the test statistic is relatively insensitive to misspecification of distributional assumptions. These two order selection tests show excellent performance under low frequency alternatives. We consider a nonparametric lack-of-fit test of constant regression in presence of heteroscedastic variances This test has better power for data from high frequency alternatives than the four tests reviewed above.

The Hypotheses and Test Statistic
Asymptotic Distribution of the Test Statistic under the Null Hypothesis
Results under Local or Fixed Alternatives
Selection of the Number of Nearest Neighbors
Monte Carlo Simulation Studies
Applications to Real Data
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call