RR-PU: A Synergistic Two-Stage Positive and Unlabeled Learning Framework for Robust Tax Evasion Detection

Shuzhi Cao,Bo Dong,Qinghua Zheng,Bin Shi,Jianfei Ruan

doi:10.1609/aaai.v38i8.28665

Abstract

Tax evasion, an unlawful practice in which taxpayers deliberately conceal information to avoid paying tax liabilities, poses significant challenges for tax authorities. Effective tax evasion detection is critical for assisting tax authorities in mitigating tax revenue loss. Recently, machine-learning-based methods, particularly those employing positive and unlabeled (PU) learning, have been adopted for tax evasion detection, achieving notable success. However, these methods exhibit two major practical limitations. First, their success heavily relies on the strong assumption that the label frequency (the fraction of identified taxpayers among tax evaders) is known in advance. Second, although some methods attempt to estimate label frequency using approaches like Mixture Proportion Estimation (MPE) without making any assumptions, they subsequently construct a classifier based on the error-prone label frequency obtained from the previous estimation. This two-stage approach may not be optimal, as it neglects error accumulation in classifier training resulting from the estimation bias in the first stage. To address these limitations, we propose a novel PU learning-based tax evasion detection framework called RR-PU, which can revise the bias in a two-stage synergistic manner. Specifically, RR-PU refines the label frequency initialization by leveraging a regrouping technique to fortify the MPE perspective. Subsequently, we integrate a trainable slack variable to fine-tune the initial label frequency, concurrently optimizing this variable and the classifier to eliminate latent bias in the initial stage. Experimental results on three real-world tax datasets demonstrate that RR-PU outperforms state-of-the-art methods in tax evasion detection tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RR-PU: A Synergistic Two-Stage Positive and Unlabeled Learning Framework for Robust Tax Evasion Detection

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

A Tax Evasion Detection Method Based on Positive and Unlabeled Learning with Network Embedding Features
Lingyun Mi ... Qinghua Zheng
-
Lingyun Mi, et. al.Lingyun Mi ... Qinghua Zheng
01 Jan 2020
01 Jan 2020

Positive and Unlabeled Learning with Label Disambiguation
Chuang Zhang ... Jian Yang
-
Chuang Zhang, et. al.Chuang Zhang ... Jian Yang
01 Aug 2019
01 Aug 2019

TTED-PU:A Transferable Tax Evasion Detection Method Based on Positive and Unlabeled Learning
Fa Zhang ... Qinghua Zheng
-
Fa Zhang, et. al.Fa Zhang ... Qinghua Zheng
01 Jul 2020
01 Jul 2020

Improving Neural Relation Extraction with Positive and Unlabeled Learning
Zhengqiu He ... Wenliang Chen
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34
Zhengqiu He, et. al.Zhengqiu He ... Wenliang Chen
03 Apr 2020
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RR-PU: A Synergistic Two-Stage Positive and Unlabeled Learning Framework for Robust Tax Evasion Detection

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence