Abstract

The insurance claim is a basic problem in insurance companies. Insurance insurers always have a challenge to the growing of insurance claim loss. Because there is the occurrence of claim fraud and the volume of claim data increases in the insurance companies. As a result, it is difficult to classify the insured claim status during the claim review process. Therefore, the aims of the study was to build a machine learning model that classifies and make motor insurance claim status prediction in machine learning approach. To achieve this study Missing value ratio, Z- Score, encoding techniques and entropy were used as data set preparation techniques. The final preprocessed data sets split using K- Fold cross validation techniques into training and testing sets. Finally the prediction model was built using Random Forest (RF) and Multi Class –Support Vector Machine (SVM).The performance of the models, RF and Multi –Class SVM classifiers were evaluated using Accuracy, Precision, Recall, and F- measure. The prediction accuracy of the model is capable of predicting the motor insurance claim status with 98.36% and 98.17% by RF and SVM classifiers respectively. As a result, RF classifier is slightly better than Multi-Class Support vector machines. Developing and implementing hybrid model to benefit from the advantages of different algorithms having graphical user interface to apply the solution to real world problem of the insurance company is a pressing future work.

Highlights

  • Insurance company is fast growing, industry [1] [2]

  • What are the better classification techniques that would use for claim classification and how we evaluate the performance of the built machine learning model?

  • Classification is the most common type of problems [15], because of this there are evaluation metrics, which we used to evaluate the performance of the built machine learning models

Read more

Summary

Introduction

Insurance company is fast growing, industry [1] [2]. It has great role in assuring economic wellbeing of a country, and Insurance claims in insurance companies are costly problems [3]. Insurance companies have business problems, such as risk assessment, classification of policy holders and resource allocation, insurance claim classification and prediction in the insurance claim handling process [3]. This insurance business problems were not solved using traditional analytical approaches, including regression, linear programming [5]. There is a recent emphasis to use different sources, of data which extends beyond traditional data sources, often known as big data This big data has created to change data management across the insurance industry [7] [8]. Data variety and data volume push the traditional data management (Relational Database Management System (RDBMS) technologies and software tools because of their restrictions [7] [9]

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.