Twenty Years of Machine-Learning-Based Text Classification: A Systematic Review

Ashokkumar Palanivinayagam,Robertas Damaševičius,Claude Ziad El-Bayeh

doi:10.3390/a16050236

Abstract

Machine-learning-based text classification is one of the leading research areas and has a wide range of applications, which include spam detection, hate speech identification, reviews, rating summarization, sentiment analysis, and topic modelling. Widely used machine-learning-based research differs in terms of the datasets, training methods, performance evaluation, and comparison methods used. In this paper, we surveyed 224 papers published between 2003 and 2022 that employed machine learning for text classification. The Preferred Reporting Items for Systematic Reviews (PRISMA) statement is used as the guidelines for the systematic review process. The comprehensive differences in the literature are analyzed in terms of six aspects: datasets, machine learning models, best accuracy, performance evaluation metrics, training and testing splitting methods, and comparisons among machine learning models. Furthermore, we highlight the limitations and research gaps in the literature. Although the research works included in the survey perform well in terms of text classification, improvement is required in many areas. We believe that this survey paper will be useful for researchers in the field of text classification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Algorithms	Publication Date: Apr 29, 2023
Citations: 16	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Twenty Years of Machine-Learning-Based Text Classification: A Systematic Review

Abstract

Talk to us

Similar Papers

More From: Algorithms

Lead the way for us

Similar Papers

An Optimized E-Lecture Video Retrieval based on Machine Learning Classification
Lakshmi Haritha Medida* ... Kasarapu Ramani
International Journal of Engineering and Advanced Technology | VOL. 8
Lakshmi Haritha Medida*, et. al.Lakshmi Haritha Medida* ... Kasarapu Ramani
30 Aug 2019
International Journal of Engineering and Advanced Technology | VOL. 8

Software defect prediction: A multi-criteria decision-making approach
A.O Balogun ... H.A Mojeed
Nigerian Journal of Technological Research | VOL. 15
A.O Balogun, et. al.A.O Balogun ... H.A Mojeed
30 Apr 2020
Nigerian Journal of Technological Research | VOL. 15

Classification of Persian News Articles using Machine Learning Techniques
...
-
, et. al. ...
17 May 2021
17 May 2021

Information-Theoretic Bounds on Quantum Advantage in Machine Learning.
Hsin-Yuan Huang ... John Preskill
Physical Review Letters | VOL. 126
Hsin-Yuan Huang, et. al.Hsin-Yuan Huang ... John Preskill
14 May 2021
Physical Review Letters | VOL. 126

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Twenty Years of Machine-Learning-Based Text Classification: A Systematic Review

Abstract

Talk to us

Similar Papers

More From: Algorithms