Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification

Khalil El Hindi,Safwan Qasem,Hussien Alsalman,Saad Al Ahmadi

doi:10.3390/e20110857

Abstract

Text classification is one domain in which the naive Bayesian (NB) learning algorithm performs remarkably well. However, making further improvement in performance using ensemble-building techniques proved to be a challenge because NB is a stable algorithm. This work shows that, while an ensemble of NB classifiers achieves little or no improvement in terms of classification accuracy, an ensemble of fine-tuned NB classifiers can achieve a remarkable improvement in accuracy. We propose a fine-tuning algorithm for text classification that is both more accurate and less stable than the NB algorithm and the fine-tuning NB (FTNB) algorithm. This improvement makes it more suitable than the FTNB algorithm for building ensembles of classifiers using bagging. Our empirical experiments, using 16-benchmark text-classification data sets, show significant improvement for most data sets.

Highlights

In text classification, the task is to assign a document to a category of a predefined set of categories
This section is divided into two subsections: In the first, we review the related work on ensembles of classifiers in general, and building ensembles of naive Bayesian (NB) classifiers in particular; in the second, we review the FTNB algorithm [14] for fine-tuning NB classifiers
Our results showed that the Gradual FTNB (GFTNB) algorithm outperformed the FTNB algorithm in terms of the average classification accuracy for the 16 text-classification data sets, and in terms of the number of data sets for which it achieved better and significantly better average accuracy

Summary

Introduction

The task is to assign a document to a category of a predefined set of categories. Bagging [5] and boosting [6,7] are probably the most widely used methods for building ensembles of classifiers They train the constituent classifiers using different samples of the training data. Making further improvement by building an ensemble of several NB classifiers is a challenge because NB is a stable algorithm [12], in the sense that a small change in the training data does not lead to a substantially different classifier. We use the fine-tuning method to generate a diverse ensemble of NB classifiers for text classification.

Related Work

Building Ensembles of Classifiers

Fine-Tuning the NB Algorithm

Bagging NB and the Fine-Tuning Algorithms

Bagging the NB and FTNB Algorithms for Text Classification

Building

Comparing

Modifying the Termination Condition

Findings

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: Nov 7, 2018
Citations: 16	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

An Efficient Approach to Detect Liver Disorder Using Naive Bayes in Comparison with Decision Tree Algorithm to Measure Accuracy
M.M Zaheer ... P Nirmala
CARDIOMETRY | VOL. -
M.M Zaheer, et. al.M.M Zaheer ... P Nirmala
14 Feb 2023
CARDIOMETRY | VOL. -

Prediction of Early Stage of Fatty Liver Disease in Patients using Logistic Regression and Naive Bayes Algorithm
O Pavithra ... R Karthikeyan
-
O Pavithra, et. al.O Pavithra ... R Karthikeyan
06 Oct 2022
06 Oct 2022

Fine tuning the Naïve Bayesian learning algorithm
Khalil El Hindi
AI Communications | VOL. 27
Khalil El HindiKhalil El Hindi
01 Jan 2014
AI Communications | VOL. 27

Enhancing Performance of naïve bayes in text classification by introducing an extra weight using less number of training examples
Shahnaj Parvin Shathi ... Md Nadim
-
Shahnaj Parvin Shathi, et. al.Shahnaj Parvin Shathi ... Md Nadim
01 Dec 2016
01 Dec 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy