Simple Classification into Large Topic Ontology of Web Documents

Marko Grobelnik,Dunja Mladeni�

doi:10.2498/cit.2005.04.04

Simple Classification into Large Topic Ontology of Web Documents

Marko Grobelnik, Dunja Mladeni�

Open Access

https://doi.org/10.2498/cit.2005.04.04

Copy DOI

Journal: Journal of Computing and Information Technology	Publication Date: Jan 1, 2005
Citations: 19	License type: cc-by-nd

#Large Ontology #Web Documents + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The paper presents an approach to classifying Web documents into large topic ontology. The main emphasis is on having a simple approach appropriate for handling a large ontology and providing it with enriched data by including additional information on the Web page context obtained from the link structure of the Web. The context is generated from the in-coming and out-going links of the Web document we want to classify (the target document), meaning that for representing a document we use, not only text of the document itself, but also the text from the documents pointing to the target document, as well as the text from the documents the target document is pointing to. The idea is that providing enriched data is compensating for the simplicity of the approach while keeping it efficient and capable of handling large topic ontology.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Journal of Computing and Information Technology

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.