Abstract

Fraud is an aggravating problem in the health insurance system, causing a substantial increase in the cost of medical services. Many models have been developed using data mining or machine learning techniques to lessen the impact of fraud on healthcare system. However, achieving good accuracy is still challenging as the claims data is multivariate with multiple class overlappings. In this paper, we propose a novel approach of unsupervised multivariate analysis for healthcare claims submitted by the providers. Our proposed model analyzes multivariate categorical data and continuous data in two stages to observe providers’ behaviors. The first stage constructs Weighted MultiTree (WMT) for categorical data to analyze similarity among provider profiles, the relation among profiles, and rendered services to identify false services. The second stage detects false claims by developing a univariate fraud detection model using different Density Based Clustering (DBC) techniques on continuous data of claims such as service counts and service charges. The performance of our proposed WMTDBC is measured by conducting experiments on the claims within various medical specialties or provider types of CMS part B program. Our empirical results evidence that the detection performance is enhanced with our WMTDBC approach when compared with the state-of-the-art models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.