Abstract

We describe a SMS-based information system called CATS, which allows posting and searching through free Arabic text using Information Extraction (IE) technology. We discuss the challenges of applying IE technology for unedited real Arabic text. In addition, we describe the structure of this system and our approach to produce an open robust system capable of including more sub domains with the minimum effort.

Highlights

  • Natural language is considered the simplest technique of human-machine interaction

  • Information Extraction (IE) systems are usually designed for a specific domain, and the types of facts to be extracted are defined in advance [11]

  • Most of the researchers believe that the IE technology is promising and pertinent to a wide range of fields, much of the research have been directed toward news items found in the web

Read more

Summary

Introduction

Natural language is considered the simplest technique of human-machine interaction. It is suitable for naïve users who know the task domain well. Information Extraction (IE) is a comparatively new technology within the more general field of Natural Language Processing. The current development in the field of IE can be followed in to the Message Understanding Conferences (MUCs) In this competition English has always been the unique target language, with the exception of MUC-6 (MET-1), where Spanish and Chinese were considered as well [10]. We will describe our efforts for employing IE technology in a SMS based information system called CATS, which uses Arabic as an interaction language for connecting sellers and buyers through SMS in the classified ads domain

Background
Information Extraction and Arabic
Overview
Template Design
The Dictionary
Natural Language Processing for the Classified Ads
Knowledge Component
Evaluating the CATS System
Future Work
Conclusion
Findings
Authors
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.