This article discusses the innovation of statistical modeling in regression analysis with a semiparametric approach applied to health data. Regression analysis is a method in statistics that takes a lot of roles in statistical modeling. Regression analysis is used to model the relationship between the independent variable (x) and the dependent variable (y). There are three approaches to regression analysis, namely parametric, nonparametric, and a combination of the two, namely semiparametric. Semiparametric regression is used when the dependent variable has a known relationship with some of the independent variables and has an unknown pattern of a relationship with some of the other independent variables. The purpose of this study was to model hemoglobin levels in dengue fever patients, with the independent variables used being the number of hematocrits (x1) and the number of leukocytes (x2). The method used is spline truncated semiparametric regression. The truncated spline estimator was chosen for the nonparametric component because it has many advantages in modeling, one of which is being able to model patterns where the form of the relationship is unknown. The parameter estimation used is the maximum estimation. Selection of the optimal knot point using Generalized Cross-Validation (GCV). Based on the results of the analysis, the truncated spline semiparametric regression model was obtained which was applied to the hemoglobin level data in a model with three knots which have a coefficient of determination of 89.074%. Based on the results of testing the hypothesis simultaneously, it can be concluded that simultaneously the independent variable has a significant effect on the dependent variable. In the partial test, it is concluded that the variables x1 and x2 have a significant influence on the dependent variable y .
Read full abstract