Text Classification Techniques: A Literature Review

M Thangaraj,M Sivakami

doi:10.28945/4066

Abstract

Aim/Purpose: The aim of this paper is to analyze various text classification techniques employed in practice, their strengths and weaknesses, to provide an improved awareness regarding various knowledge extraction possibilities in the field of data mining. Background: Artificial Intelligence is reshaping text classification techniques to better acquire knowledge. However, in spite of the growth and spread of AI in all fields of research, its role with respect to text mining is not well understood yet. Methodology: For this study, various articles written between 2010 and 2017 on “text classification techniques in AI”, selected from leading journals of computer science, were analyzed. Each article was completely read. The research problems related to text classification techniques in the field of AI were identified and techniques were grouped according to the algorithms involved. These algorithms were divided based on the learning procedure used. Finally, the findings were plotted as a tree structure for visualizing the relationship between learning procedures and algorithms. Contribution: This paper identifies the strengths, limitations, and current research trends in text classification in an advanced field like AI. This knowledge is crucial for data scientists. They could utilize the findings of this study to devise customized data models. It also helps the industry to understand the operational efficiency of text mining techniques. It further contributes to reducing the cost of the projects and supports effective decision making. Findings: It has been found more important to study and understand the nature of data before proceeding into mining. The automation of text classification process is required, with the increasing amount of data and need for accuracy. Another interesting research opportunity lies in building intricate text data models with deep learning systems. It has the ability to execute complex Natural Language Processing (NLP) tasks with semantic requirements. Recommendations for Practitioners: Frame analysis, deception detection, narrative science where data expresses a story, healthcare applications to diagnose illnesses and conversation analysis are some of the recommendations suggested for practitioners. Recommendation for Researchers: Developing simpler algorithms in terms of coding and implementation, better approaches for knowledge distillation, multilingual text refining, domain knowledge integration, subjectivity detection, and contrastive viewpoint summarization are some of the areas that could be explored by researchers. Impact on Society: Text classification forms the base of data analytics and acts as the engine behind knowledge discovery. It supports state-of-the-art decision making, for example, predicting an event before it actually occurs, classifying a transaction as ‘Fraudulent’ etc. The results of this study could be used for developing applications dedicated to assisting decision making processes. These informed decisions will help to optimize resources and maximize benefits to the mankind. Future Research: In the future, better methods for parameter optimization will be identified by selecting better parameters that reflects effective knowledge discovery. The role of streaming data processing is still rarely explored when it comes to text classification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Interdisciplinary Journal of Information, Knowledge, and Management	Publication Date: Jan 1, 2018
Citations: 67	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

Text Classification Techniques: A Literature Review

Abstract

Talk to us

Similar Papers

More From: Interdisciplinary Journal of Information, Knowledge, and Management

Lead the way for us

Similar Papers

A Theoretical Study on Advanced Techniques in Pre-processing and Text Classification
...
International Journal of Data Mining And Emerging Technologies | VOL. 5
, et. al. ...
01 Jan 2015
International Journal of Data Mining And Emerging Technologies | VOL. 5

Using text classification and multiple concepts to answer e-mails
Sung-Shun Weng ... Chih-Kai Liu
Expert Systems with Applications | VOL. 26
Sung-Shun Weng, et. al.Sung-Shun Weng ... Chih-Kai Liu
26 Nov 2003
Expert Systems with Applications | VOL. 26

Enhancing accident cause analysis through text classification and accident causation theory: A case study of coal mine gas explosion accidents
Qingsong Jia ... Shihan Hu
Process Safety and Environmental Protection | VOL. 185
Qingsong Jia, et. al.Qingsong Jia ... Shihan Hu
19 Mar 2024
Process Safety and Environmental Protection | VOL. 185

A Review of a Text Classification Technique: K-Nearest Neighbor
R.S Zhou ... Z.J Wang
-
R.S Zhou, et. al.R.S Zhou ... Z.J Wang
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text Classification Techniques: A Literature Review

Abstract

Talk to us

Similar Papers

More From: Interdisciplinary Journal of Information, Knowledge, and Management