Feature weighted naïve Bayes algorithm for information retrieval of enterprise systems

Li Wang,Ping Ji,Jing Qi,Siqing Shan,Zhuming Bi,Weiguo Deng,Naijing Zhang

doi:10.1080/17517575.2013.860481

Abstract

Automated information retrieval is critical for enterprise information systems to acquire knowledge from the vast amount of data sets. One challenge in information retrieval is text classification. Current practices rely heavily on the classical naïve Bayes algorithm due to its simplicity and robustness. However, results from this algorithm are not always satisfactory. In this article, the limitations of the naïve Bayes algorithm are discussed, and it is found that the assumption on the independence of terms is the main reason for an unsatisfactory classification in many real-world applications. To overcome the limitations, the dependent factors are considered by integrating a term frequency–inverse document frequency (TF-IDF) weighting algorithm in the naïve Bayes classification. Moreover, the TF-IDF algorithm itself is improved so that both frequencies and distribution information are taken into consideration. To illustrate the effectiveness of the proposed method, two simulation experiments were conducted, and the comparisons with other classification methods have shown that the proposed method has outperformed other existing algorithms in terms of precision and index recall rate.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Feature weighted naïve Bayes algorithm for information retrieval of enterprise systems

Abstract

Talk to us

Similar Papers

More From: Enterprise Information Systems

Lead the way for us

Journal: Enterprise Information Systems	Publication Date: Dec 17, 2013
Citations: 6

Similar Papers

Detection of SQL Injection Attacks Based on Improved TFIDF Algorithm
Yingbo Li ... Bin Zhang
Journal of Physics: Conference Series | VOL. 1395
Yingbo Li, et. al.Yingbo Li ... Bin Zhang
01 Nov 2019
Journal of Physics: Conference Series | VOL. 1395

Text Features Extraction based on TF-IDF Associating Semantic
Qing Liu ... Naiyao Wang
-
Qing Liu, et. al.Qing Liu ... Naiyao Wang
01 Dec 2018
01 Dec 2018

Tourism Destination Article Search Features using TF-IDF and Cosine similarity
Mizanul Ridho Aohana ... Fitri Bimantoro
DIELEKTRIKA | VOL. 10
Mizanul Ridho Aohana, et. al.Mizanul Ridho Aohana ... Fitri Bimantoro
31 Aug 2023
DIELEKTRIKA | VOL. 10

Analysis of whether news on the Internet is real or fake by using deep learning methods and the TF-IDF algorithm
Tilbe Korkmaz ... Mehmet Ali Barişkan
International Advanced Researches and Engineering Journal | VOL. 5
Tilbe Korkmaz, et. al.Tilbe Korkmaz ... Mehmet Ali Barişkan
15 Apr 2021
International Advanced Researches and Engineering Journal | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature weighted naïve Bayes algorithm for information retrieval of enterprise systems

Abstract

Talk to us

Similar Papers

More From: Enterprise Information Systems