Looking into the Operational Modalities Adopted in Some of the POS Tagging Tools in Identification of Contextual Part-of-Speech of Words in Texts

Kesavan Vadakalur Elumalai,Anas Maktabi,Mufleh Salem M Alqahtani,Niladri Sekhar Das

doi:10.7575/aiac.ijalel.v.8n.6p.92

Abstract

Part-of-speech (POS) tagging is an indispensable method of text processing. The main aim is to assign part-of-speech to words after considering their actual contextual syntactic-cum-semantic roles in a piece of text where they occur (Siemund & Claridge 1997). This is a useful strategy in language processing, language technology, machine learning, machine translation, and computational linguistics as it generates a kind of output that enables a system to work with natural language texts with greater accuracy and success. Part-of-speech tagging is also known as ‘grammatical annotation’ and ‘word category disambiguation’ in some area of linguistics where analysis of form and function of words are important avenues for better comprehension and application of texts. Since the primary task of POS tagging involves a process of assigning a tag to each word, manually or automatically, in a piece of natural language text, it has to pay adequate attention to the contexts where words are used. This is a tough challenge for a system as it normally fails to know how word carries specific linguistic information in a text and what kind of larger syntactic frames it requires for its operation. The present paper takes up this issue into consideration and tries to critically explore how some of the well-known POS tagging systems are capable of handling this kind of challenge and if these POS tagging systems are at all successful in assigning appropriate POS tags to words without accessing information from extratextual domains. The novelty of the paper lies in its attempt for looking into some of the POS tagging schemes proposed so far to see if the systems are actually successful in dealing with the complexities involved in tagging words in texts. It also checks if the performance of these systems is better than manual POS tagging and verifies if information and insights gathered from such enterprises are at all useful for enhancing our understanding about identity and function of words used in texts. All these are addressed in this paper with reference to some of the POS taggers available to us. Moreover, the paper tries to see how a POS tagged text is useful in various applications thereby creating a sense of awareness about multifunctionality of tagged texts among language users.

Highlights

An electronically developed corpus, after it is annotated at the part-of-speech level, becomes useful for various works of language analysis, processing, application and reference in language technology, applied linguistics, translation, dictionary compilation, language teaching and description (Sinclair 2004)
The main aim is to assign part-of-speech to words after considering their actual contextual syntactic-cumsemantic roles in a piece of text where they occur (Siemund & Claridge 1997). This is a useful strategy in language processing, language technology, machine learning, machine translation, and computational linguistics as it generates a kind of output that enables a system to work with natural language texts with greater accuracy and success
In descriptive and applied linguistics, for instance, POS tagging of words is necessary because we find that words are able to represent different parts-of-speech in different contexts

Summary

Introduction

An electronically developed corpus (i.e., digital language database), after it is annotated at the part-of-speech level, becomes useful for various works of language analysis, processing, application and reference in language technology, applied linguistics, translation, dictionary compilation, language teaching and description (Sinclair 2004). Since the primary task of POS tagging involves a process of assigning a tag to each word, manually or automatically, in a piece of natural language text, it has to pay adequate attention to the contexts where words are used.

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Looking into the Operational Modalities Adopted in Some of the POS Tagging Tools in Identification of Contextual Part-of-Speech of Words in Texts

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Applied Linguistics and English Literature

Lead the way for us

Journal: International Journal of Applied Linguistics and English Literature	Publication Date: Nov 30, 2019
License type: CC BY 4.0

Similar Papers

Implementation of Kadazan Tagger Based on Brill's Method
Marylyn Alex ... Lailatul Qadri Zakaria
Journal of ICT Research and Applications | VOL. 7
Marylyn Alex, et. al.Marylyn Alex ... Lailatul Qadri Zakaria
01 Dec 2013
Journal of ICT Research and Applications | VOL. 7

Building Machine Learning System with Deep Neural Network for Text Processing
... Anshika Rastogi
-
, et. al. ... Anshika Rastogi
17 Aug 2017
17 Aug 2017

Combination of Genetic Algorithm and Brill Tagger Algorithm for Part of Speech Tagging Bahasa Madura
Nindian Puspa Dewi ... Ubaidi Ubaidi
Proceeding of the Electrical Engineering Computer Science and Informatics | VOL. 7
Nindian Puspa Dewi, et. al.Nindian Puspa Dewi ... Ubaidi Ubaidi
01 Oct 2020
Proceeding of the Electrical Engineering Computer Science and Informatics | VOL. 7

Part of speech tagging for Arabic
Sandra Kübler ... Emad Mohamed
Natural Language Engineering | VOL. 18
Sandra Kübler, et. al.Sandra Kübler ... Emad Mohamed
06 Dec 2011
Natural Language Engineering | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Looking into the Operational Modalities Adopted in Some of the POS Tagging Tools in Identification of Contextual Part-of-Speech of Words in Texts

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Applied Linguistics and English Literature