Abstract

The knowledge contained within the natural language data can be used to build expert systems. Classifying unstructured data using ontology and text classification algorithms to extract information is one way of approaching the problem of building intelligent systems. One major problem with text processing is most data generated is unstructured and ambiguous, as, data with a structure helps to identify meaningful patterns and eventually exhibit the latent knowledge. Ambiguity in natural language affects accuracy of categorization. Also, Natural Language Processing techniques when combined with semantic data modeling through ontological knowledge will also solve the problem of domain knowledge representation thereby enabling improved data classification facilities, particularly in large datasets where number of features scale to unmanageable proportions. In this paper, the domain knowledge is presented as a knowledge graph, derived from the semantic data modeling. Further, to achieve better Multi Class classification, Multinomial Naive Bayes algorithm is applied to categorize items in their respective classes. For the experiments, Data about various news groups were used for testing the accuracy of the model. Experimental results have proved that the proposed classifier performs better compared to existing systems.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call