Extracting Arabic Composite Names Using Genitive Principles of Arabic Grammar

Hussein Khalil,Taha Osman,Mohammed Miltan

doi:10.1145/3382187

Abstract

Named Entity Recognition (NER) is a basic prerequisite of using Natural Language Processing (NLP) for information retrieval. Arabic NER is especially challenging as the language is morphologically rich and has short vowels with no capitalisation convention. This article presents a novel rule-based approach that uses linguistic grammar-based techniques to extract Arabic composite names from Arabic text. Our approach uniquely exploits the genitive Arabic grammar rules; in particular, the rules regarding the identification of definite nouns (معرفة) and indefinite nouns (نكرة) to support the process of extracting composite names. Based on domain knowledge and Arabic Genitive Rules (AGR), the developed approach formalises a set of syntactical rules and linguistic patterns that initially use genitive patterns to classify definiteness within phrases and then extracts proper composite names from the unstructured text. The developed novel approach does not place any constraints on the length of the Arabic composite name and our initial experimentation demonstrated high recall and precision results when the NER algorithm was applied to a financial domain corpus.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Extracting Arabic Composite Names Using Genitive Principles of Arabic Grammar

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Jun 7, 2020
Citations: 5

Similar Papers

A Novel Hybrid Approach to Arabic Named Entity Recognition
Mohamed A Meselhi ... Khaled Shaalan
-
Mohamed A Meselhi, et. al.Mohamed A Meselhi ... Khaled Shaalan
01 Jan 2014
01 Jan 2014

A Survey of Arabic Named Entity Recognition and Classification
Khaled Shaalan
Computational Linguistics | VOL. 40
Khaled ShaalanKhaled Shaalan
01 Jun 2014
Computational Linguistics | VOL. 40

A Comparative Review of Machine Learning for Arabic Named Entity Recognition
Ramzi Esmail Salah ... Lailatul Qadri Binti Zakaria
International Journal on Advanced Science, Engineering and Information Technology | VOL. 7
Ramzi Esmail Salah, et. al.Ramzi Esmail Salah ... Lailatul Qadri Binti Zakaria
16 Apr 2017
International Journal on Advanced Science, Engineering and Information Technology | VOL. 7

Arabic Named Entity Recognition—A Survey and Analysis
Amal Dandashi ... Sebti Foufou
-
Amal Dandashi, et. al.Amal Dandashi ... Sebti Foufou
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Extracting Arabic Composite Names Using Genitive Principles of Arabic Grammar

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing