Category-learning attention mechanism for short text filtering

Tian Xia,Xuemin Chen

doi:10.1016/j.neucom.2022.08.076

Abstract

In machine translation, the attention mechanism highlights relevant words according to the distances between the source and target vectors dynamically. However, its ability to optimize text classification is limited because this mechanism only calculates weights inside the same text. There is an uneven distribution of words between ham (not spam) and spam categories, and this category-level feature has not previously been utilized in the attention mechanism for filtering short texts. In addition, short text filtering is uniquely challenging due to the length, sparsity and informal writing of texts, as well as a need for rapid processing. We propose a novel category-level attention mechanism called “category-learning attention,” which highlights words intensely distributed in the same category by dynamically calculating a category differentiation matrix for each short text. The category-learning attention mechanism is extended to the category-learning scaled-dot-product attention and the category-learning multi-head attention (CL-MHA) mechanisms. The CL-MHA mechanism is then applied to a bidirectional gate recurrent unit (Bi-GRU) model for performance evaluation by using the SMS spam collection dataset hosted at the University of California, Irvine. Performance metrics including the accuracy, precision, recall, and F1 score demonstrate that the CL-MHA mechanism significantly improves the performance of Bi-GRU for short text filtering with an accuracy of 99.35%, higher than any previously reported machine learning models. In addition, experiments conducted on three datasets - a Chinese SMS spam dataset, a benchmark movie review dataset, and a benchmark customer review dataset - further validate the effectiveness of the proposed model. The proposed CL-MHA Bi-GRU model has an accuracy of 99.46% when evaluated on the Chinese SMS spam dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Category-learning attention mechanism for short text filtering

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Sep 3, 2022
Citations: 2

Similar Papers

A short-term wind speed prediction method utilizing rolling decomposition and time-series extension to avoid information leakage
Pinhan Zhou ... Guoji Xu
Energy Sources, Part A: Recovery, Utilization, and Environmental Effects | VOL. 46
Pinhan Zhou, et. al.Pinhan Zhou ... Guoji Xu
28 Feb 2024
Energy Sources, Part A: Recovery, Utilization, and Environmental Effects | VOL. 46

Construction of a network intrusion detection system based on a convolutional neural network and a bidirectional gated recurrent unit with attention mechanism
Andrii Nikitenko ... Yevhen Bashkov
Eastern-European Journal of Enterprise Technologies | VOL. 3
Andrii Nikitenko, et. al.Andrii Nikitenko ... Yevhen Bashkov
28 Jun 2024
Eastern-European Journal of Enterprise Technologies | VOL. 3

TGA: A Novel Network Intrusion Detection Method Based on TCN, BiGRU and Attention Mechanism
Yangyang Song ... Zhaolei Shi
Electronics | VOL. 12
Yangyang Song, et. al.Yangyang Song ... Zhaolei Shi
27 Jun 2023
Electronics | VOL. 12

Feature Fusion Text Classification Model Combining CNN and BiGRU with Multi-Attention Mechanism
Jingren Zhang ... Fang’Ai Liu
Future Internet | VOL. 11
Jingren Zhang, et. al.Jingren Zhang ... Fang’Ai Liu
12 Nov 2019
Future Internet | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Category-learning attention mechanism for short text filtering

Abstract

Talk to us

Similar Papers

More From: Neurocomputing