Retracted] A New Rule‐Based Approach for Classical Arabic in Natural Language Processing

Ramzi Salah,Lailatul Qadri Binti Zakaria,Muaadh Mukred,Rashad Ahmed,Hasan Sari

doi:10.1155/2022/7164254

Abstract

Named entity recognition (NER) is fundamental in several natural language processing applications. It involves finding and categorizing text into predefined categories such as a person's name, location, and so on. One of the most famous approaches to identify named entity is the rule‐based approach. This paper introduces a rule‐based NER method that can be used to examine Classical Arabic documents. The proposed method relied on triggers words, patterns, gazetteers, rules, and blacklists generated by the linguistic information about entities named in Arabic. The method operates in three stages, operational stage, preprocessing stage, and processing the rule application stage. The proposed approach was evaluated, and the results indicate that this approach achieved a 90.2% rate of precision, an 89.3% level of recall, and an F‐measure of 89.5%. This new approach was introduced to overcome the challenges related to coverage in rule‐based NER systems, especially when dealing with Classical Arabic texts. It improved their performance and allowed for automated rule updates. The grammar rules, gazetteers, blacklist, patterns, and trigger words were all integrated into the rule‐based system in this way.

Highlights

Named entity recognition is a crucial step in numerous natural language processing (NLP) applications such as machine translation, question answering, and information retrieval, to name a few [1, 2]
We introduce a rule-based Named entity recognition (NER) method that can be used to examine Classical Arabic documents. e proposed method relied on triggers words, patterns, gazetteers, rules, Journal of Mathematics and blacklists generated by the linguistic information pertaining to entities named in Arabic
En, the operational contents were discussed with the preprocessing and processing stages. e new approach proposed by this study used trigger words, gazetteers, regular expressions, grammatical rules, and blacklists, and the methodology was explained

Summary

Introduction

Named entity recognition is a crucial step in numerous natural language processing (NLP) applications such as machine translation, question answering, and information retrieval, to name a few [1, 2]. Arabic is a morphologically complex language due to its inflectional nature; it has a general form of a word: prefix(es) + stem + suffix(es), with the number of prefixes and suffixes ranging from 0 to many. Another issue is that, depending on its position in the world, an Arabic letter can take up to three different forms [9, 10]. E proposed method relied on triggers words, patterns, gazetteers, rules, Journal of Mathematics and blacklists generated by the linguistic information pertaining to entities named in Arabic.

Related Work

Linguistic Resources

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Mathematics	Publication Date: Jan 1, 2022
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Retracted] A New Rule‐Based Approach for Classical Arabic in Natural Language Processing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Mathematics

Lead the way for us

Similar Papers

A Comparative Study of Dictionary-based and Machine Learning-based Named Entity Recognition in Pashto
Rafiullah Momand ... Ahmad Masood Latif Rai
-
Rafiullah Momand, et. al.Rafiullah Momand ... Ahmad Masood Latif Rai
18 Dec 2020
18 Dec 2020

Named Entity Recognition using Support Vector Machine: A Language Independent Approach
...
Zenodo (CERN European Organization for Nuclear Research) | VOL. -
, et. al. ...
23 Mar 2010
Zenodo (CERN European Organization for Nuclear Research) | VOL. -

A Concise Review of Named Entity Recognition System: Methods and Features
M Ikhwan Syafiq ... M Shukor Talib
IOP Conference Series: Materials Science and Engineering | VOL. 551
M Ikhwan Syafiq, et. al.M Ikhwan Syafiq ... M Shukor Talib
01 Aug 2019
IOP Conference Series: Materials Science and Engineering | VOL. 551

Named Entity Recognition Using Acyclic Weighted Digraphs: A Semi-supervised Statistical Method
Kono Kim ... Harksoo Kim
-
Kono Kim, et. al.Kono Kim ... Harksoo Kim
22 May 2007
22 May 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Retracted] A New Rule‐Based Approach for Classical Arabic in Natural Language Processing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Mathematics