Enhancing the Performance of Telugu Named Entity Recognition Using Gazetteer Features

Saikiranmai Gorla,Lalita Bhanu Murthy Neti,Aruna Malapati

doi:10.3390/info11020082

Abstract

Named entity recognition (NER) is a fundamental step for many natural language processing tasks and hence enhancing the performance of NER models is always appreciated. With limited resources being available, NER for South-East Asian languages like Telugu is quite a challenging problem. This paper attempts to improve the NER performance for Telugu using gazetteer-related features, which are automatically generated using Wikipedia pages. We make use of these gazetteer features along with other well-known features like contextual, word-level, and corpus features to build NER models. NER models are developed using three well-known classifiers—conditional random field (CRF), support vector machine (SVM), and margin infused relaxed algorithms (MIRA). The gazetteer features are shown to improve the performance, and theMIRA-based NER model fared better than its counterparts SVM and CRF.

Highlights

Named entity recognition (NER) is a sub-task of information extraction (IE) to identify and classify textual elements into a pre-defined set of categories called named entities (NEs) such as the name of a person, organization, or location, expressions of time, quantities, monetary values, percentages, etc
We put forth an approach to generate gazetteers dynamically for three named entities—person, location, and organization—and propose gazetteer-based features for Telugu NER
We performed morphological pre-processing and used language-dependent features to enhance the performance of the NER models

Summary

Introduction

Named entity recognition (NER) is a sub-task of information extraction (IE) to identify and classify textual elements (words or sequences of words) into a pre-defined set of categories called named entities (NEs) such as the name of a person, organization, or location, expressions of time, quantities, monetary values, percentages, etc. NER plays an essential role in extracting knowledge from the digital information stored in a structured or unstructured form. It acts as a pre-processing tool for many applications, and some of these applications are listed below: . The research study by Babych and Hartley [4] showed that including a pre-processing step by tagging text with

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information	Publication Date: Feb 2, 2020
Citations: 3	License type: CC BY 4.0

R Discovery Prime

Enhancing the Performance of Telugu Named Entity Recognition Using Gazetteer Features

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Information

Lead the way for us

Similar Papers

AUC Maximization for Low-Resource Named Entity Recognition
Ngoc Dang Nguyen ... Wei Tan
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37
Ngoc Dang Nguyen, et. al.Ngoc Dang Nguyen ... Wei Tan
26 Jun 2023
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37

Improving Chinese Named Entity Recognition by Large-Scale Syntactic Dependency Graph
Peng Zhu ... Dingjiang Huang
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30
Peng Zhu, et. al.Peng Zhu ... Dingjiang Huang
01 Jan 2021
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30

A Residual BiLSTM Model for Named Entity Recognition
Gang Yang ... Hongzhe Xu
IEEE Access | VOL. 8
Gang Yang, et. al.Gang Yang ... Hongzhe Xu
01 Jan 2020
IEEE Access | VOL. 8

Evaluating Medical Entity Recognition in Health Care: Entity Model Quantitative Study.
Shengyu Liu ... Ming Zhong
JMIR medical informatics | VOL. 12
Shengyu Liu, et. al.Shengyu Liu ... Ming Zhong
17 Oct 2024
JMIR medical informatics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Enhancing the Performance of Telugu Named Entity Recognition Using Gazetteer Features

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Information