Abstract

Tax evasion refers to an entity indulging in illegal activities to avoid paying their actual tax liability. A tax return statement is a periodic report comprising information about income, expenditure, etc. One of the most basic tax evasion methods is failing to file tax returns or delay filing tax return statements. The taxpayers who do not file their returns, or fail to do so within the stipulated period are called tax return defaulters. As a result, the Government has to bear the financial losses due to a taxpayer defaulting, which varies for each taxpayer. Therefore, while designing any statistical model to predict potential return defaulters, we have to consider the real financial loss associated with the misclassification of each individual. This paper proposes a framework for an example-dependent cost-sensitive stacking classifier that uses cost-insensitive classifiers as base generalizers to make predictions on the input space. These predictions are used to train an example-dependent cost-sensitive meta generalizer. Based on the meta-generalizer choice, we propose four variant models used to predict potential return defaulters for the upcoming tax-filing period. These models have been developed for the Commercial Taxes Department, Government of Telangana, India. Applying our proposed variant models to GST data, we observe a significant increase in savings compared to conventional classifiers. Additionally, we develop an empirical study showing that our approach is more adept at identifying potential tax return defaulters than existing example-dependent cost-sensitive classification algorithms.

Highlights

  • Taxes can be classified into direct taxes, which are payable directly to the government

  • We show that our Proposed Approach (PA) is adept at identifying potential tax return defaulters for the upcoming month with high accuracy

  • We propose a framework for example-dependent cost-sensitive stacked generalization comprising four variant models

Read more

Summary

Introduction

Taxes can be classified into direct taxes, which are payable directly to the government (Eg. Income tax). These taxes cannot be transferred to any other third party, and indirect taxes, which can be shifted to a third party by the entity that is levied the tax (Eg. VAT, excise duty). The Goods and Services Tax (GST) system is an indirect taxation system introduced in India in July 2017. This paper proposes a methodology to predict potential tax return defaulters for the GST system [1]

Working of the GST system
Motivation for this work
Related Work
Data Description and Feature Extraction
GSTR-1 Data
Monthly GST Returns Data
Creation of Network of taxpayers
Feature Extraction
Division-Name
Seasonality
Framework for Example Dependent Stacking Classifier
Stacked Generalizers and General Framework
Cost function
Variant A
Variant B
Variant C
Variant D
Cost Matrix
Performance of Proposed Variants
F1-Score
Confusion and Cost Matrices
Training and Testing ROC Curves
Savings score
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call