Abstract

Credit-granting institutions need to estimate the probability of loan default, which represents the chance a customer fails to make repayments as promised. Critically this estimation is intertwined with the competing risk a customer fully repays their loan while also having key predictive drivers with values that change over time. A conventional model in this setting is a competing risks Cox Model with time-varying covariates. However partial likelihood estimation of this model has two shortcomings: (1) the baseline hazard is not estimated, so calculating probabilities requires a further estimation step; and (2) a covariance matrix for both regression coefficients and the baseline hazard is not produced. This paper caters for these shortcomings by devising a maximum likelihood technique to jointly estimate regression coefficients and the cause-specific baseline hazards using constrained optimisation to ensure the latter’s non-negativity. We show via simulation our technique produces regression coefficients estimates with lower bias in small samples with heavy censoring. When applied to a real-world credit risk dataset consisting of home loan data our Maximum Likelihood approach produces a smoother estimate of the cause-specific baseline hazards for default and redemption than those obtained using the Partial Likelihood and Breslow approach. This provides better clarity of the shape of these functions through both a less volatile central estimate as well as quantifying the error of this central estimate. We implement our method in R.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call