Sequence Embeddings Help Detect Insurance Fraud

Ivan Fursov,Elizaveta Kovtun,Rasul Khasyanov,Rodrigo Rivera-Castro,Alexey Zaytsev,Evgeny Burnaev,Martin Spindler

doi:10.1109/access.2022.3149480

Abstract

Roughly 10 percent of the insurance industry’s incurred losses are estimated to stem from fraudulent claims. One solution is to use tabular data to construct models that can distinguish between claims that are legitimate and those that are fraudulent. However, while canonical tabular data models enable robust fraud detection, complex sequential data have been out of the insurance industry’s scope. For health insurance, we propose deep learning architectures that process insurance data consisting of sequential records of patient visits and characteristics. Both the sequential and tabular components improve the quality of the model, generating new insights into the detection of health insurance fraud. Empirical results derived using relevant data from a health insurance company show that our approach outperforms state-of-the-art models and can substantially improve the claims management process. We obtain a ROC AUC metric of 0.873, while the best competitor based on state-of-the-art models achieves 0.815. Moreover, we demonstrate that our architectures are more robust to data corruption. As more and more semi-structured event sequence data become available to insurers, our methods will be valuable for many similar applications, particularly when variables have a large number of categories, such as those from the International Classification of Disease (ICD) codes or other classification codes.

Highlights

Fraud causes substantial costs and losses for the finance and insurance industries
We propose architectures for categorical sequence embeddings via deep learning that help improve the classification of fraudulent and valid claims compared to other machine learning methods
RESULTS we evaluate the metrics of our classic machine learning and deep learning models

Summary

Introduction

Fraud causes substantial costs and losses for the finance and insurance industries. Examples include fraudulent credit card transactions and insurance fraud. Fraud detection is a critical function and core competence in these industries and their claims management processes. The proliferation of digitization in finance and insurance has led to big datasets suited to fraud detection. We propose architectures for categorical sequence embeddings via deep learning that help improve the classification of fraudulent and valid claims compared to other machine learning methods. We review different ways to cope with class imbalance problem that is typical for fraud detection. Researchers from many disciplines investigate it for application domains, including time-series modeling, [2], [3], predictive maintenance of technical systems, [4], [5], and applications in the finance and insurance industries, [6]

Objectives

Methods

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Sequence Embeddings Help Detect Insurance Fraud

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Mediclaim Fraud Detection and Management Using Predictive Analytics
M R Sumalatha ... M Prabha
-
M R Sumalatha, et. al.M R Sumalatha ... M Prabha
01 Dec 2019
01 Dec 2019

Towards Better Detection of Fraud in Health Insurance Claims in Kenya: Use of Naïve Bayes Classification Algorithm
Christopher A Moturi ... Sharifa R Mambo
East African Journal of Information Technology | VOL. 5
Christopher A Moturi, et. al.Christopher A Moturi ... Sharifa R Mambo
23 Dec 2022
East African Journal of Information Technology | VOL. 5

Fraud detection and frequent pattern matching in insurance claims using data mining techniques
Aayushi Verma ... Anuja Arora
-
Aayushi Verma, et. al.Aayushi Verma ... Anuja Arora
01 Aug 2017
01 Aug 2017

Research on the risk governance of fraudulent reimbursement of patient consultation fees.
Jiangjie Sun ... Yue Wang
Frontiers in public health | VOL. 12
Jiangjie Sun, et. al.Jiangjie Sun ... Yue Wang
12 Feb 2024
Frontiers in public health | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sequence Embeddings Help Detect Insurance Fraud

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access