Abstract

It is a very important task that how to classify Web pages automatically and effectively in accordance with the given model for machine learning. The traditional operation modes, including artificial way and semiautomatic way, form category abstracts after domain experts' personnel inspection and then put the results into a particular class library according to the scheduled requirements. An improved naive Bayesian Web text classification algorithm is proposed in this paper. The common Bayesian classifier assumes that all the items are equally important while in this paper the terms in each title are considered to be more important than others. Experiments showed that, the improved naive Bayesian algorithm is more precise in the text classification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.