Abstract

Automatic text classification has gained huge popularity with the advancement of information technology. Bayesian method has been found highly appropriate for text classification but it suffers from a number of problems. When there is large number of categories, lack of uniformity in training data becomes a big problem. Some nodes may get less training documents, while other may get a very large number. Therefore, some nodes are biased over others. Besides, presence of noise data or outliers also creates problems. Moreover, when documents are very small, just like a line item describing a product, the problem becomes more difficult. In this paper we describe a method that combines naive Bayesian text classification technique and neural networks to handle these problems. We start with a naive Bayesian classifier, which has the linear separating surfaces. We modify the separating surfaces using neural network to find better separating surfaces and hence better classification accuracy over validation data.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.