A Feasible Chinese Text Data Preprocessing Strategy

Jingang Liu,Haihua Yan,Chunhe Xia,Jie Sun

doi:10.1109/uemcon51285.2020.9298131

Abstract

With the rapid rise of artificial intelligence technologies such as machine learning and the rapid development of the big data industry, more and more attention is paid to the use of data itself, especially the Chinese text data, which is more complex in expression and richer in the information. It is a necessary step to process the raw Chinese text data before it is used for specific application tasks. However, the current strategies for processing data are generally to deal with data in different fields and specific application tasks. In this paper, to further improve the quality of Chinese data processing and give play to the application value of Chinese data, we propose a general and feasible Chinese text preprocessing strategy, named the multi-level data preprocessing strategy (MLDPS). This strategy uses four effective links to process raw Chinese text data systematically. We believe that the proposed MLDPS has relatively strong practical significance, and provides a better idea for preprocessing Chinese text data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Feasible Chinese Text Data Preprocessing Strategy

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

On semi-automated extraction of causal networks from raw text
Solat J Sheikh ... Alexander H Levis
Engineering Applications of Artificial Intelligence | VOL. 123
Solat J Sheikh, et. al.Solat J Sheikh ... Alexander H Levis
29 Mar 2023
Engineering Applications of Artificial Intelligence | VOL. 123

How Much Can Machines Learn Finance From Chinese Text Data?
Jianqing Fan ... Lirong Xue
SSRN Electronic Journal | VOL. -
Jianqing Fan, et. al.Jianqing Fan ... Lirong Xue
01 Jan 2020
SSRN Electronic Journal | VOL. -

Domain ontology graph model and its application in Chinese text classification
James N K Liu ... Edward H Y Lim
Neural Computing and Applications | VOL. 24
James N K Liu, et. al.James N K Liu ... Edward H Y Lim
19 Dec 2012
Neural Computing and Applications | VOL. 24

Research on Classification of Chinese Text Data Based on SVM
Yuan Lin ... Tao Xu
IOP Conference Series: Materials Science and Engineering | VOL. 231
Yuan Lin, et. al.Yuan Lin ... Tao Xu
01 Sep 2017
IOP Conference Series: Materials Science and Engineering | VOL. 231

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Feasible Chinese Text Data Preprocessing Strategy

Abstract

Talk to us

Similar Papers