Abstract
We describe a SMS-based information system called CATS, which allows posting and searching through free Arabic text using Information Extraction (IE) technology. We discuss the challenges of applying IE technology for unedited real Arabic text. In addition, we describe the structure of this system and our approach to produce an open robust system capable of including more sub domains with the minimum effort.
Highlights
Natural language is considered the simplest technique of human-machine interaction
Information Extraction (IE) systems are usually designed for a specific domain, and the types of facts to be extracted are defined in advance [11]
Most of the researchers believe that the IE technology is promising and pertinent to a wide range of fields, much of the research have been directed toward news items found in the web
Summary
Natural language is considered the simplest technique of human-machine interaction. It is suitable for naïve users who know the task domain well. Information Extraction (IE) is a comparatively new technology within the more general field of Natural Language Processing. The current development in the field of IE can be followed in to the Message Understanding Conferences (MUCs) In this competition English has always been the unique target language, with the exception of MUC-6 (MET-1), where Spanish and Chinese were considered as well [10]. We will describe our efforts for employing IE technology in a SMS based information system called CATS, which uses Arabic as an interaction language for connecting sellers and buyers through SMS in the classified ads domain
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal of Interactive Mobile Technologies (iJIM)
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.