Accurate Prediction of Bangla Text Article Categorization by Utilizing Novel Bangla Stemmer

Mahtab Uddin

doi:10.5875/xbzrk013

Abstract

Text categorization involves assigning predefined category labels to an unlabeled document. With the exponential growth in the accessibility and availability of digital documents over the past decade, this field significantly attracted the scientific community that immensely demands rapid and accurate categorization of these documents. Relying on experts for manual classification is time-consuming and resource-intensive. Consequently, labeling unlabeled digital documents faster more accurately, and more efficiently is inescapable. One promising approach to addressing this demand is the use of machine learning algorithms. Training these algorithms on a large dataset of labeled texts lets them learn patterns and predicted unlabeled documents. This strategy might greatly expedite the categorizing process while retaining a substantial level of accuracy through leveraging artificial intelligence. These algorithms have also enhanced natural language processing techniques, making them more accurate at classifying unlabeled digital documents. In this study, we propose a novel machine-learning computational framework to address this challenge. Our framework incorporates a novel Bangla stemmer, which reduces words to their stems. We then employed TF-IDF for document vectorization, a statistical measure assessing word relevance for categorization purposes. Experimental results reveal that our framework significantly enhances prediction performance, achieving an impressive 95.3% prediction accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Accurate Prediction of Bangla Text Article Categorization by Utilizing Novel Bangla Stemmer

Abstract

Talk to us

Similar Papers

More From: International Journal of Automation and Smart Technology

Lead the way for us

Journal: International Journal of Automation and Smart Technology	Publication Date: Oct 10, 2024
License type: CC BY-NC-ND 4.0

Similar Papers

The use of machine learning algorithms in recommender systems: A systematic review
Ivens Portugal ... Donald Cowan
Expert Systems with Applications | VOL. 97
Ivens Portugal, et. al.Ivens Portugal ... Donald Cowan
09 Dec 2017
Expert Systems with Applications | VOL. 97

Untersuchungen über die photosynthetische Leistung gelbblättriger Gehölze )
Klaus Michael
Flora oder Allgemeine Botanische Zeitung | VOL. 141
Klaus MichaelKlaus Michael
01 Jan 1953
Flora oder Allgemeine Botanische Zeitung | VOL. 141

Evaluation of machine learning algorithms for fast video transcoding in streaming services
Thiago Bubolz ... Guilherme Correa
-
Thiago Bubolz, et. al.Thiago Bubolz ... Guilherme Correa
29 Oct 2019
29 Oct 2019

Progress towards machine learning reaction rate constants.
Evan Komp ... Nida Janulaitis
Physical Chemistry Chemical Physics | VOL. 24
Evan Komp, et. al.Evan Komp ... Nida Janulaitis
01 Jan 2021
Physical Chemistry Chemical Physics | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accurate Prediction of Bangla Text Article Categorization by Utilizing Novel Bangla Stemmer

Abstract

Talk to us

Similar Papers

More From: International Journal of Automation and Smart Technology