Chinese short news text classification based on BERT and sparse autoencoder

Jiuzhou Lin

doi:10.1117/12.2659618

Abstract

Short news text classification plays an import role in natural language processing as the popularity of mobile phones. In this paper we propose a Chinese short news text classification method based on BERT and sparse autoencoder, regarding the overfitting caused by pretrained BERT. We use the BERT for text representation, the output vectors of BERT are dimension reduced through the sparse autoencoder, and then the Softmax classifier takes the reduced vectors as input to get the prediction of the input text. Experimental results show that our method mitigate the unbalance of the performance of different categories, raises the overall classification performance by six percentage, effectively alleviates the overfitting of text representation of BERT, and achieve a better Chinese short text classification performance than using naïve autoencoder and without autoencoder.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Chinese short news text classification based on BERT and sparse autoencoder

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Six-Granularity Based Chinese Short Text Classification
Xinjie Sun ... Xingying Huo
IEEE Access | VOL. 11
Xinjie Sun, et. al.Xinjie Sun ... Xingying Huo
01 Jan 2023
IEEE Access | VOL. 11

A Fine-grained Chinese Short Text Classification Method Based on Capsule Networks
Yangshuyi Xu ... Lin Zhang
Journal of Physics: Conference Series | VOL. 2555
Yangshuyi Xu, et. al.Yangshuyi Xu ... Lin Zhang
01 Jul 2023
Journal of Physics: Conference Series | VOL. 2555

Review of Chinese Short Text Classification
Fen Lin Wu ... Cheng Wang
Applied Mechanics and Materials | VOL. 336-338
Fen Lin Wu, et. al.Fen Lin Wu ... Cheng Wang
01 Jul 2013
Applied Mechanics and Materials | VOL. 336-338

Word-Level and Pinyin-Level Based Chinese Short Text Classification
Xinjie Sun ... Xingying Huo
IEEE Access | VOL. 10
Xinjie Sun, et. al.Xinjie Sun ... Xingying Huo
01 Jan 2021
IEEE Access | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Chinese short news text classification based on BERT and sparse autoencoder

Abstract

Talk to us

Similar Papers