Multimodal fraudulent website identification method based on heterogeneous model ensemble

Shengli Zhou,Qingyang Xu,Mincheng Chen,Linqi Ruan

doi:10.23919/jcc.fa.2022-0234.202305

Abstract

The feature analysis of fraudulent websites is of great significance to the combat, prevention and control of telecom fraud crimes. Aiming to address the shortcomings of existing analytical approaches, i.e. single dimension and venerability to anti-reconnaissance, this paper adopts the Stacking, the ensemble learning algorithm, combines multiple modalities such as text, image and URL, and proposes a multimodal fraudulent website identification method by ensembling heterogeneous models. Cross-validation is first used in the training of multiple largely different base classifiers that are strong in learning, such as BERT model, residual neural network (ResNet) and logistic regression model. Classification of the text, image and URL features are then performed respectively. The results of the base classifiers are taken as the input of the meta-classifier, and the output of which is eventually used as the final identification. The study indicates that the fusion method is more effective in identifying fraudulent websites than the single-modal method, and the recall is increased by at least 1%. In addition, the deployment of the algorithm to the real Internet environment shows the improvement of the identification accuracy by at least 1.9% compared with other fusion methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multimodal fraudulent website identification method based on heterogeneous model ensemble

Abstract

Talk to us

Similar Papers

More From: China Communications

Lead the way for us

Journal: China Communications	Publication Date: May 1, 2023
Citations: 2

Similar Papers

Deep limits of residual neural networks
Matthew Thorpe ... Yves Van Gennip
Research in the Mathematical Sciences | VOL. 10
Matthew Thorpe, et. al.Matthew Thorpe ... Yves Van Gennip
16 Dec 2022
Research in the Mathematical Sciences | VOL. 10

Multi-Type Object Tracking Based on Residual Neural Network Model
Tao Jiang ... Chen Li
Symmetry | VOL. 14
Tao Jiang, et. al.Tao Jiang ... Chen Li
15 Aug 2022
Symmetry | VOL. 14

Fault diagnosis of wind turbine gearbox based on residual neural network
Z.-W Duan ... G.-J Jiang
-
Z.-W Duan, et. al.Z.-W Duan ... G.-J Jiang
01 Jan 2021
01 Jan 2021

Human activity classification based on sound recognition and residual convolutional neural network
Minhyuk Jung ... Seokho Chi
Automation in Construction | VOL. 114
Minhyuk Jung, et. al.Minhyuk Jung ... Seokho Chi
20 Mar 2020
Automation in Construction | VOL. 114

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multimodal fraudulent website identification method based on heterogeneous model ensemble

Abstract

Talk to us

Similar Papers

More From: China Communications