Ontology Based Structured Representation for Domain Specific Unstructured Documents

H.L Shashirekha,S Murali

doi:10.1109/iccima.2007.255

Ontology Based Structured Representation for Domain Specific Unstructured Documents

H.L Shashirekha, S Murali

https://doi.org/10.1109/iccima.2007.255

Copy DOI

Publication Date: Dec 1, 2007

Citations: 4

#Domain Dependent Ontology #Unstructured Text + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Extracting information from unstructured, brief and short text composed of short phrases, incomplete sentences, unordered sequence of words and words in short form not falling into any regular syntax is a challenging task. This paper describes an approach to automatically extract information from data rich unstructured text documents based on a domain dependent ontology and populate a database. Here, we apply pattern matching in terms of keywords/constants to extract the patterns and generate a structured text representation with respect to a domain specific ontology. The approach is illustrated on one such unstructured, short and brief text -classified matrimonial advertisement. The performance analysis of the approach on this case study is presented.

Full Text