Abstract

Regression models with change-points have been widely applied in various fields. Most methodologies for change-point regressions assume Gaussian errors. For many real data having longer-than-normal tails or atypical observations, the use of normal errors may unduly affect the fit of change-point regression models. This paper proposes two robust algorithms called EMT and FCT for change-point regressions by incorporating the t-distribution with the expectation and maximization algorithm and the fuzzy classification procedure, respectively. For better resistance to high leverage outliers, we introduce a modified version of the proposed method, which fits the t change-point regression model to the data after moderately pruning high leverage points. The selection of the degrees of freedom is discussed. The robustness properties of the proposed methods are also analyzed and validated. Simulation studies show the effectiveness and resistance of the proposed methods against outliers and heavy-tailed distributions. Extensive experiments demonstrate the preference of the t-based approach over normal-based methods for better robustness and computational efficiency. EMT and FCT generally work well, and FCT always performs better for less biased estimates, especially in cases of data contamination. Real examples show the need and the practicability of the proposed method.

Highlights

  • These facts illustrate that FCT with DF equal to 1 (FCT1) and FCT1tm are more resistant against unusual observations; especially, FCT1-tm is capable of withstanding high leverage outliers

  • This paper proposed a new robust method for CP regressions using t-distributions

  • The t-distribution provides a longer tailed substitute to the normal distribution and has an additional parameter called degrees of freedom (DF), which is critical for tuning robustness

Read more

Summary

Introduction

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. Fearnhead and Rigaill [6] provided a real example to illustrate the CP detection errors caused by outliers They remarked that many CP methods are sensitive to outliers because of the modeling assumption of Gaussian noises, with much work on CP regressions being carried out for Gaussian models, such as in [7,8,9,10,11]. The longer tail of a t-distribution makes it a more robust approach to CP regression data fitting by giving less weight to observations that are atypical in the computation of CPs and regression parameters.

Related Work
An EMT Algorithm for Change-Point Regression Models
A FCT Algorithm for t CP Regression Models
Decide the Degrees of Freedom of t-Distributions
Method
Models usedused to show
Resistance against High Leverage Outliers
The Robust Properties of EMT and FCT
Simulation Studies
Fitting
Discussion
Findings
Models
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call