In insurance, proposing an accurate premium that is adjusted to the insured risk profile allows companies to better manage their portfolios and to be more competitive. Machine learning methods have recently been adopted for various improvements in insurance ratemaking, especially in the automobile industry. These models are specifically used to mine potential data information and to build a predictive model for a variable of interest using explanatory variables. In this paper, we aim to provide a pricing method for ratemaking individual healthcare insurance contracts using machine learning algorithms that are applied to a Tunisian healthcare insurance portfolio. We start with a simple Classification and Regression Tree, and we work toward more advanced methods that are Random Forest, Extreme gradient boosting, Support Vector Regression, and Artificial Neural network regression model. The predictive performance of these non-parametric methods is compared with the standard generalized linear model. Our results showed the applicability of machine learning in the healthcare insurance market and that the XGBoost algorithm outperforms the predictive capacity of the classical generalized linear model.
Read full abstract