Abstract

Web document classification has become crucial as there has been a massive increase in the magnitude of web pages across the web. In the research community, an efficient approach to this problem is based on machine learning techniques. Ontology forms the heart of knowledge representation for any domain. This paper proposes an ontology-based term weighting technique which is novel and efficient for the classification of web pages. The proposed approach builds domain ontology and selects the features that significantly improve the prediction performance. Experiments were conducted on domain based web pages and classification performance was calculated with state of the art classification algorithms. The experimental analysis demonstrates that the proposed approach produces significantly better results compared to the traditional keyword-based approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.