Part-of-Speech Tagging via Deep Neural Networks for Northern-Ethiopic Languages

Jurgita Kapočiūtė-Dzikienė,Senait Gebremichael Tesfagergish

doi:10.5755/j01.itc.49.4.26808

Jurgita Kapočiūtė-Dzikienė, Senait Gebremichael Tesfagergish

Open Access

https://doi.org/10.5755/j01.itc.49.4.26808

Copy DOI

Abstract

Deep Neural Networks (DNNs) have proven to be especially successful in the area of Natural Language Processing (NLP) and Part-Of-Speech (POS) tagging—which is the process of mapping words to their corresponding POS labels depending on the context. Despite recent development of language technologies, low-resourced languages (such as an East African Tigrinya language), have received too little attention. We investigate the effectiveness of Deep Learning (DL) solutions for the low-resourced Tigrinya language of the Northern-Ethiopic branch. We have selected Tigrinya as the testbed example and have tested state-of-the-art DL approaches seeking to build the most accurate POS tagger. We have evaluated DNN classifiers (Feed Forward Neural Network – FFNN, Long Short-Term Memory method – LSTM, Bidirectional LSTM, and Convolutional Neural Network – CNN) on a top of neural word2vec word embeddings with a small training corpus known as Nagaoka Tigrinya Corpus. To determine the best DNN classifier type, its architecture and hyper-parameter set both manual and automatic hyper-parameter tuning has been performed. BiLSTM method was proved to be the most suitable for our solving task: it achieved the highest accuracy equal to 92% that is 65% above the random baseline.

Highlights

POS tagging is an important application of Natural Language Processing (NLP) and a core concept that many higher-level language technologies depend on
Deep Neural Networks (DNNs) have proven to be especially successful in the area of Natural Language Processing (NLP) and Part-Of-Speech (POS) tagging—which is the process of mapping words to their corresponding POS labels depending on the context
We investigate the effectiveness of Deep Learning (DL) solutions for the low-resourced Tigrinya language of the Northern-Ethiopic branch

Summary

Introduction

POS tagging ( called grammatical tagging) is an important application of NLP and a core concept that many higher-level language technologies depend on. NLP applications as machine translation, speech recognition, dependency parsing and many more depend on POS tagging to be more accurate. Despite from a human point-of-view the manual POS tagging looks a rather easy task, it is a challenging AI problem to solve, mainly due to words disambiguation. Languages are different by their nature and morphological complexity, there is no single smart solution that could solve all POS tagging problems for all languages of the world. The different annotation schema issue is not tackled, whereas disambiguation issues can be resolved by training Machine Learning (ML) methods with the enough manually POS tagged corpora (so-called gold-standard corpora)

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information Technology And Control	Publication Date: Dec 19, 2020
Citations: 8	License type: cc-by

R Discovery Prime

R Discovery Prime

Part-of-Speech Tagging via Deep Neural Networks for Northern-Ethiopic Languages

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information Technology And Control

Lead the way for us

Similar Papers

Growing random forest on deep convolutional neural networks for scene categorization
Shuang Bai
Expert Systems with Applications | VOL. 71
Shuang BaiShuang Bai
17 Oct 2016
Expert Systems with Applications | VOL. 71

An Extended Benchmark System of Word Embedding Methods for Vulnerability Detection
Hai Nguyen Ngoc ... Tetsutaro Uehara
-
Hai Nguyen Ngoc, et. al.Hai Nguyen Ngoc ... Tetsutaro Uehara
26 Nov 2020
26 Nov 2020

Deep Kronecker neural networks: A general framework for neural networks with adaptive activation functions
Ameya D Jagtap ... George Em Karniadakis
Neurocomputing | VOL. 468
Ameya D Jagtap, et. al.Ameya D Jagtap ... George Em Karniadakis
14 Oct 2021
Neurocomputing | VOL. 468

Sliding Window and Parallel LSTM with Attention and CNN for Sentence Alignment on Low-Resource Languages
Tien-Ping Tan ... Wan Rose Eliza Abdul Rahman
Pertanika Journal of Science and Technology | VOL. 30
Tien-Ping Tan, et. al.Tien-Ping Tan ... Wan Rose Eliza Abdul Rahman
24 Nov 2021
Pertanika Journal of Science and Technology | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Part-of-Speech Tagging via Deep Neural Networks for Northern-Ethiopic Languages

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information Technology And Control