Urdu Named Entity Recognition and Classification System Using Artificial Neural Network

Muhammad Kamran Malik

doi:10.1145/3129290

Abstract

Named Entity Recognition and Classification (NERC) is a process of identifying words and classifying them into person names, location names, organization names, and so on. In this article, we discuss the development of an Urdu Named Entity (NE) corpus, called the Kamran-PU-NE (KPU-NE) corpus, for three entity types, that is, Person, Organization, and Location, and marking the remaining tokens as Others (O). We use two supervised learning algorithms, Hidden Markov Model (HMM) and Artificial Neural Network (ANN), for the development of the Urdu NERC system. We annotate the 652852-token corpus taken from 15 different genres with a total of 44480 NEs. The inter-annotator agreement between the two annotators in terms of Kappa k statistic is 73.41%. With HMM, the highest recorded precision, recall, and f-measure values are 55.98%, 83.11%, and 66.90%, respectively, and with ANN, they are 81.05%, 87.54%, and 84.17%, respectively.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Urdu Named Entity Recognition and Classification System Using Artificial Neural Network

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Sep 15, 2017
Citations: 24

Similar Papers

Word Embedding 자질을 이용한 한국어 개체명 인식 및 분류
Yunsu Choi ... Jeongwon Cha
Journal of KIISE | VOL. 43
Yunsu Choi, et. al.Yunsu Choi ... Jeongwon Cha
15 Jun 2016
Journal of KIISE | VOL. 43

Using Wikipedia for Cross-Language Named Entity Recognition
Eraldo R Fernandes ... Jordi Atserias
-
Eraldo R Fernandes, et. al.Eraldo R Fernandes ... Jordi Atserias
01 Jan 2015
01 Jan 2015

Named Entity Recognition System for Postpositional Languages: Urdu as a Case Study
Muhammad Kamran ... Syed Mansoor
International Journal of Advanced Computer Science and Applications | VOL. 7
Muhammad Kamran, et. al.Muhammad Kamran ... Syed Mansoor
01 Jan 2015
International Journal of Advanced Computer Science and Applications | VOL. 7

Kannada Named Entity Recognition and Classification using Support Vector Machine
...
Transactions on Machine Learning and Artificial Intelligence | VOL. 5
, et. al. ...
11 Mar 2017
Transactions on Machine Learning and Artificial Intelligence | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Urdu Named Entity Recognition and Classification System Using Artificial Neural Network

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing