Abstract
An influence measure for investigating the influence of deleting an observation in linear regression is proposed based on geometric thoughts of the sampling distribution of the distance between two estimators of regression coefficients computed with and without a single specific observation. The covariance matrix of the above sampling distribution plays a key role in deriving the influence measure. It turns out that geometrically, this distance is distributed entirely along the axis associated with the nonnull eigenvalue of the covariance matrix. The deviation of the regression coefficients computed without an observation from the regression coefficients computed with the full data is reflected in the eigenvalue of the covariance matrix which can be used for investigating the influence. The distance is normalized using the associated covariance matrix and this normalized distance turns out to be the square of internally studentized residuals. Illustrative examples for showing the effectiveness of the influence measure proposed here are given. In judging the influence of observations on the least squares estimates of regression coefficients, Cook's distance does not work well for one example and therefore we should be cautious about a blind use of the Cook's distance.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.