A Text Category Detection and Information Extraction Algorithm with Deep Learning

Xiaohan Wu,Yuqi Feng,Zejun Wu

doi:10.1088/1742-6596/1982/1/012047

Abstract

In order to solve the problem that the text classification model based on neural network is easy to over-fit and ignore the key words in sentences in the training process, a Bi-GRU Chinese text classification model based on hierarchical Attention mechanism is proposed. The model introduces the idea of layering, uses bi-directional gated cyclic neural network to learn the text representation at word level and sentence level, uses Self-Attention hierarchical model to obtain the information of the influence of words and sentences on text classification, shares the weight between embedded layer and softmax layer by binding, and uses AMSBound optimization method to obtain the optimal weight matrix quickly and effectively while reducing the parameters in the model. Two commonly used Chinese data sets, FudanSet and THUCNews, are tested on the long Chinese text classification data set FudanSet. The experimental results show that the accuracy, recall rate and F-score of this model are better than Text-CNN model, Attention-BiLSTM model and Bi-GRU_CNN model, and the accuracy, recall rate and F-score index are improved by 5.9%, 5.8% and 4.6%, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Physics: Conference Series	Publication Date: Jul 1, 2021
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

A Text Category Detection and Information Extraction Algorithm with Deep Learning

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Similar Papers

Bi-GRU model based on pooling and attention for text classification
Yu Lan Hu ... Qing Shan Zhao
International Journal of Wireless and Mobile Computing | VOL. 21
Yu Lan Hu, et. al.Yu Lan Hu ... Qing Shan Zhao
01 Jan 2020
International Journal of Wireless and Mobile Computing | VOL. 21

Reentrancy Vulnerability Detection of Smart Contract Based on Bidirectional Sequential Neural Network with Hierarchical Attention Mechanism
Guangxia Xu ... Zhaojian Zhou
-
Guangxia Xu, et. al.Guangxia Xu ... Zhaojian Zhou
01 Jul 2022
01 Jul 2022

Study on Chinese text classification for FastText that combing TF-RF and improved random walk model
Zheng Wang
-
Zheng WangZheng Wang
09 Apr 2021
09 Apr 2021

Bi-GRU model based on pooling and attention for text classification
Yu Lan Hu ... Qing Shan Zhao
International Journal of Wireless and Mobile Computing | VOL. 21
Yu Lan Hu, et. al.Yu Lan Hu ... Qing Shan Zhao
01 Jan 2020
International Journal of Wireless and Mobile Computing | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Text Category Detection and Information Extraction Algorithm with Deep Learning

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series