The semiparametric regression models have received a lot of attention from researchers recently because it combines parametric and nonparametric methods, it is one of the advanced topics in data analysis for various studies, which aims to find the best capabilities and a high level of efficiency. One of the most important semiparametric regression models is the partial linear regression model (PLM), which consists of a parametric component and a nonparametric component, for the purpose of estimating the parametric component, the difference method will be used to remove the nonparametric component. When the analysis hypotheses of the parametric component are not fulfilled, it will suffer from several problems, the most important of which is the problem of complete multicollinearity, besides the multicollinearity, there are also outliers in the data. In this research, the problems of multicollinearity and outliers of the semiparametric regression model were addressed, where simulation was used to generate data with different sample sizes and for different correlations and outlier ratios and for different methods such as [Difference Ridge based M robust with Nadaraya – Watson (DRMNW), Difference Ridge based S robust with Nadaraya – Watson (DRSNW), Difference Ridge based M robust with Smoothing spline (DRMSP), Difference Ridge based S robust with Smoothing spline (DRSSP)], the results showed that method Difference Ridge based M robust with Smoothing spline (DRMSP) is the best estimator.
Read full abstract