Supervised Learning Methods for Bangla Web Document Categorization

Ashis Kumar Mandal,Rikta Sen

doi:10.5121/ijaia.2014.5508

Abstract

This paper explores the use of machine learning approaches, or more specifically, four supervised learning Methods, namely Decision Tree(C 4.5), K-Nearest Neighbour (KNN), Na\"ive Bays (NB), and Support Vector Machine (SVM) for categorization of Bangla web documents. This is a task of automatically sorting a set of documents into categories from a predefined set. Whereas a wide range of methods have been applied to English text categorization, relatively few studies have been conducted on Bangla language text categorization. Hence, we attempt to analyze the efficiency of those four methods for categorization of Bangla documents. In order to validate, Bangla corpus from various websites has been developed and used as examples for the experiment. For Bangla, empirical results support that all four methods produce satisfactory performance with SVM attaining good result in terms of high dimensional and relatively noisy document feature vectors.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Supervised Learning Methods for Bangla Web Document Categorization

Abstract

Talk to us

Similar Papers

More From: International Journal of Artificial Intelligence & Applications

Lead the way for us

Journal: International Journal of Artificial Intelligence & Applications	Publication Date: Sep 30, 2014
Citations: 89

Similar Papers

Time-domain heart rate variability features for automatic congestive heart failure prediction.
Jeban Chandir Moses ... Maia Angelova
ESC heart failure | VOL. 11
Jeban Chandir Moses, et. al.Jeban Chandir Moses ... Maia Angelova
27 Nov 2023
ESC heart failure | VOL. 11

EEG-based excitement detection in immersive environments: An improved deep learning approach
Jason Teo ... Jia Tian Chia
-
Jason Teo, et. al.Jason Teo ... Jia Tian Chia
01 Jan 2018
01 Jan 2018

Single_cell_GRN: gene regulatory network identification based on supervised learning method and Single-cell RNA-seq data
Bin Yang ... Baitong Chen
BioData Mining | VOL. 15
Bin Yang, et. al.Bin Yang ... Baitong Chen
11 Jun 2022
BioData Mining | VOL. 15

Prediction of phases in high entropy alloys using machine learning
Ravindranadh Bobbili ... B Ramakrishna
Materials Today Communications | VOL. 36
Ravindranadh Bobbili, et. al.Ravindranadh Bobbili ... B Ramakrishna
17 Jul 2023
Materials Today Communications | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Supervised Learning Methods for Bangla Web Document Categorization

Abstract

Talk to us

Similar Papers

More From: International Journal of Artificial Intelligence &amp; Applications

More From: International Journal of Artificial Intelligence & Applications