Abstract

We occasionally find that a small subset of the data exerts a disproportionate influence on the fitted regression model. We would like to locate these influential points and assess their impact on the model. However, the existence of influential data is complicated by the presence of collinearity (see, e.g. [15]). In this article we develop a new influence statistic for one or a set of observations in linear regression dealing with collinearity. We show that this statistic has asymptotically normal distribution and is able to detect a subset of high ridge leverage outliers. Using this influence statistic we also show that when ridge regression is used to mitigate the effects of collinearity, the influence of some observations can be drastically modified. As an illustrative example, simulation studies and a real data set are analysed.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call