A Method of Sustainable Development for Three Chinese Short-Text Datasets Based on BERT-CAM

Li Pan,Wei Hong Lim,Yong Gan

doi:10.3390/electronics12071531

Abstract

Considering the low accuracy of current short text classification (TC) methods and the difficulties they have with effective emotion prediction, a sustainable short TC (S-TC) method using deep learning (DL) in big data environments is proposed. First, the text is vectorized by introducing a BERT pre-training model. When processing language tasks, the TC accuracy is improved by removing a word from the text and using the information from previous words and the next words to predict. Then, a convolutional attention mechanism (CAM) model is proposed using a convolutional neural network (CNN) to capture feature interactions in the time dimension and using multiple convolutional kernels to obtain more comprehensive feature information. CAM can improve TC accuracy. Finally, by optimizing and merging bidirectional encoder representation from the transformers (BERT) pre-training model and CAM model, a corresponding BERT-CAM classification model for S-TC is proposed. Through simulation experiments, the proposed S-TC method and the other three methods are compared and analyzed using three datasets. The results show that the accuracy, precision, recall, F1 value, Ma_F and Mi_F are the largest, reaching 94.28%, 86.36%, 84.95%, 85.96%, 86.34% and 86.56, respectively. The algorithm’s performance is better than that of the other three comparison algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Mar 24, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Method of Sustainable Development for Three Chinese Short-Text Datasets Based on BERT-CAM

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Extremely Short Chinese Text Classification Method Based on Bidirectional Semantic Extension
Yongzeng Yue ... Yuhong Zhang
Journal of Physics: Conference Series | VOL. 1437
Yongzeng Yue, et. al.Yongzeng Yue ... Yuhong Zhang
01 Jan 2020
Journal of Physics: Conference Series | VOL. 1437

Digital Library Information Integration System Based on Big Data and Deep Learning
Xiao Lin ... Ying Zhang
Journal of Sensors | VOL. 2022
Xiao Lin, et. al.Xiao Lin ... Ying Zhang
01 Jul 2022
Journal of Sensors | VOL. 2022

Short Text Classification Method Combining Word Vector and WTTM
Junwei Ge ... Hanxiao Wang
-
Junwei Ge, et. al.Junwei Ge ... Hanxiao Wang
05 May 2020
05 May 2020

Sentiment Analysis using a CNN-BiLSTM Deep Model Based on Attention Classification
Wang Yue ... Li Lei
Information | VOL. 26
Wang Yue, et. al.Wang Yue ... Li Lei
15 Sep 2023
Information | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Method of Sustainable Development for Three Chinese Short-Text Datasets Based on BERT-CAM

Abstract

Talk to us

Similar Papers

More From: Electronics