Combinatorial Classification for Chunking Arabic Texts

Feriel Ben Fraj

doi:10.5121/ijaia.2012.3506

Abstract

Text parsing has always benefited from special attention since the first applications of natural language processing (NLP). The problem gets worse for the Arabic language because of its specific features that make it quite different and even more ambiguous than other natural languageswhen processed. In this paper, we discuss a new approach for chunking Arabic texts based on a combinatorial classification process. It is a modular chunker that identifies the chunkheads using a combinatorial binary classification before recognizing their types based on the parts -of-speech of the chunk heads, already identified. For the experimentation, we use over than 2300 wordsas training data. The evaluation of the chunker consists of two steps and gives results that we consider very satisfactory (average accuracy of 89,60% for the classification step and 80,46% for the full chunking process).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combinatorial Classification for Chunking Arabic Texts

Abstract

Talk to us

Similar Papers

More From: International Journal of Artificial Intelligence & Applications

Lead the way for us

Journal: International Journal of Artificial Intelligence & Applications	Publication Date: Sep 30, 2012
Citations: 3

Similar Papers

Examining the Dimensions of Adopting Natural Language Processing and Big Data Analytics Applications in Firms
Sheshadri Chatterjee ... Patrick Mikalef
IEEE Transactions on Engineering Management | VOL. 71
Sheshadri Chatterjee, et. al.Sheshadri Chatterjee ... Patrick Mikalef
01 Jan 2024
IEEE Transactions on Engineering Management | VOL. 71

NLPLego: Assembling Test Generation for Natural Language Processing Applications
Pin Ji ... Ruohao Zhang
ACM Transactions on Software Engineering and Methodology | VOL. -
Pin Ji, et. al.Pin Ji ... Ruohao Zhang
05 Oct 2024
ACM Transactions on Software Engineering and Methodology | VOL. -

Extraction of Construction Quality Requirements from Textual Specifications via Natural Language Processing
Jungho Jeon ... Liu Yang
Transportation Research Record: Journal of the Transportation Research Board | VOL. 2675
Jungho Jeon, et. al.Jungho Jeon ... Liu Yang
31 Mar 2021
Transportation Research Record: Journal of the Transportation Research Board | VOL. 2675

Guest Editors Introduction: Machine Learning in Speech and Language Technologies
Pascale Fung ... Dan Roth
Machine Learning | VOL. 60
Pascale Fung, et. al.Pascale Fung ... Dan Roth
01 Sep 2005
Machine Learning | VOL. 60

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combinatorial Classification for Chunking Arabic Texts

Abstract

Talk to us

Similar Papers

More From: International Journal of Artificial Intelligence &amp; Applications

More From: International Journal of Artificial Intelligence & Applications