Abstract

A news document often related to more than one category, necessary for utilization the method of categorization that is not only fast but also able to classify a news into many categories. Many methods can be used to categorize the news documents, one of which is an ontology. Ontology approach in the categorization of a document is based on the similarity of news features in documents with features that exist in the ontology. The use of ontologies in categorization that just based on the occurance of the term in calculating the relevance of the document, led to the emergence of many other features that are actually very relevant is undetectable. This paper proposed a new method for categorizing news documents are related with many categories, the method is based on a specific domain ontology and for document relevance calculation is not only based on the occurrence of the term but also take into account the relationships between terms that are formed. Tests performed on the Indonesian language news document with two categories: sports and technology. The trial results show the value of the average accuracy is high, that the sports category was 93,85% and the technology category is 96,32%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.