An hybrid approach to improve part of speech tagging system

Soufiane Farrah,Mohammed Ouzzif,El Houssaine Ziyati,Hanane El Manssouri

doi:10.1109/isacv.2018.8354032

Abstract

Platforms interacting with data in text format, such as social networks or search engines, face major challenges regarding this flow of texts such as storage, search and information processing. New disciplines have emerged as natural language processing that involve identifying all aspects of language (spoken or written). In this perspective, we focus on the aspect of part-of speech (POS) tagging applied to the Arabic language which consists in marking each word in the text with its good tag. One of the most difficult problems affecting POS tagging is the ambiguity of the text. Ambiguity is the most important problem in the natural language processing. We propose a rule-based hybrid approach with an artificial neural network classifier to determine the appropriate tags of an Arabic text. The first phase consists of extracting all the affixes to identify the nature of the word and its tags according to grammatical rules, the second phase begins by transliterating the Arabic text into text with Roman letters. The transliterated text is then transformed into digital vectors to form the input of the classifier based on the neural networks. The two phases are combined to identify the tag of each word.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An hybrid approach to improve part of speech tagging system

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

The Development of Indonesian POS Tagging System for Computer-aided Independent Language Learning
Muljono Muljono ... Umriya Afini
International Journal of Emerging Technologies in Learning (iJET) | VOL. 12
Muljono Muljono, et. al.Muljono Muljono ... Umriya Afini
16 Nov 2017
International Journal of Emerging Technologies in Learning (iJET) | VOL. 12

Hidden Markov Model based Part of Speech Tagging for Nepali language
Abhijit Paul ... Bipul Syam Purkayastha
-
Abhijit Paul, et. al.Abhijit Paul ... Bipul Syam Purkayastha
01 Sep 2015
01 Sep 2015

Combination of Genetic Algorithm and Brill Tagger Algorithm for Part of Speech Tagging Bahasa Madura
Nindian Puspa Dewi ... Ubaidi Ubaidi
Proceeding of the Electrical Engineering Computer Science and Informatics | VOL. 7
Nindian Puspa Dewi, et. al.Nindian Puspa Dewi ... Ubaidi Ubaidi
01 Oct 2020
Proceeding of the Electrical Engineering Computer Science and Informatics | VOL. 7

Parts of Speech Tagging in Bengali for MWEs Detection
Bipul Syam Purkayastha ... Md Jaynalabedin
International Journal of Computer Applications | VOL. 99
Bipul Syam Purkayastha, et. al.Bipul Syam Purkayastha ... Md Jaynalabedin
20 Aug 2014
International Journal of Computer Applications | VOL. 99

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An hybrid approach to improve part of speech tagging system

Abstract

Talk to us

Similar Papers