A classification and extraction method of attribute hybrid big data based on Naive Bayes algorithm

Liantian Li,Ling Yang

doi:10.3233/jcm-226802

Abstract

In the identification of network text information, the existing technology is difficult to accurately extract and classify text information with high propagation speed and high update speed. In order to solve this problem, the research combines the Naive Bayes algorithm with the feature two-dimensional information gain weighting method, uses the feature weighting method to optimize the Naive Bayes algorithm, and calculates the dimension of different documents and data categories through a new feature operation method. The data gain between them can improve its classification performance, and the classification models are compared and analyzed in the actual Chinese and English databases. The research results show that the classification accuracy rates of the IGDC-DWNB model in the Sogou database, 20-newsgroup database, Fudan database and Ruster21578 database are 0.89, 0.89, 0.93, and 0.88, respectively, which are higher than other classification models in the same environment. It can be seen that the model designed in the research has higher classification accuracy, stronger overall performance, and stronger reliability and robustness in practical applications, which can provide a new development idea for big data classification technology.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A classification and extraction method of attribute hybrid big data based on Naive Bayes algorithm

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Methods in Sciences and Engineering

Lead the way for us

Similar Papers

Classification of Alzheimer's disease progression based on sMRI using gray matter volume and lateralization index.
Qian Zhang ... Xiaoli Yang
PloS one | VOL. 17
Qian Zhang, et. al.Qian Zhang ... Xiaoli Yang
30 Mar 2022
PloS one | VOL. 17

Study on the Classification Method of Rice Leaf Blast Levels Based on Fusion Features and Adaptive-Weight Immune Particle Swarm Optimization Extreme Learning Machine Algorithm.
Dongxue Zhao ... Jinpeng Li
Frontiers in Plant Science | VOL. 13
Dongxue Zhao, et. al.Dongxue Zhao ... Jinpeng Li
06 May 2022
Frontiers in Plant Science | VOL. 13

Analysis of Naive Bayesian and Back Propagation algorithms in iris classification
Chengyang Yu
Applied and Computational Engineering | VOL. 37
Chengyang YuChengyang Yu
07 Feb 2024
Applied and Computational Engineering | VOL. 37

Machine Learning-Based Intelligent Scoring of College English Teaching in the Field of Natural Language Processing
Wei Wang
Computational Intelligence and Neuroscience | VOL. 2022
Wei WangWei Wang
04 Aug 2022
Computational Intelligence and Neuroscience | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A classification and extraction method of attribute hybrid big data based on Naive Bayes algorithm

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Methods in Sciences and Engineering