Application Research of Text Classification Based on Random Forest Algorithm

Yeli Li,Yanxiong Sun,Yuning Bian,Qingtao Zeng

doi:10.1109/aemcse50948.2020.00086

Application Research of Text Classification Based on Random Forest Algorithm

Yeli Li, Yanxiong Sun + Show 2 more

https://doi.org/10.1109/aemcse50948.2020.00086

Copy DOI

Publication Date: Apr 1, 2020

Citations: 16

Affiliation: Beijing Institute of Graphic Communication

#Random Forest Algorithm #Original Random Forest Algorithm + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In view of the poor classification effect of traditional random forest algorithm due to the low quality of text feature extraction, a random forest method for text information is proposed. In view of the difficulty in controlling the quality of traditional random forest decision trees, a weighted voting mechanism is proposed to improve the quality of decision trees. This algorithm uses tr-k method based on text feature extraction to improve the quality and diversity of text features, and uses the latest Bert word vector generation model to represent the text. Experimental data in Python environment show that this method can achieve better results in text classification than IDF based random forest algorithm and original random forest algorithm.

Full Text