Integrated Model for Morphological Analysis and Named Entity Recognition Based on Label Attention Networks in Korean

Hongjin Kim,Harksoo Kim

doi:10.3390/app10113740

Hongjin Kim, Harksoo Kim

Open Access

https://doi.org/10.3390/app10113740

Copy DOI

Journal: Applied Sciences	Publication Date: May 28, 2020
Citations: 3	License type: CC BY 4.0

Affiliation: Kangwon National University, Konkuk University

Abstract

In well-spaced Korean sentences, morphological analysis is the first step in natural language processing, in which a Korean sentence is segmented into a sequence of morphemes and the parts of speech of the segmented morphemes are determined. Named entity recognition is a natural language processing task carried out to obtain morpheme sequences with specific meanings, such as person, location, and organization names. Although morphological analysis and named entity recognition are closely associated with each other, they have been independently studied and have exhibited the inevitable error propagation problem. Hence, we propose an integrated model based on label attention networks that simultaneously performs morphological analysis and named entity recognition. The proposed model comprises two layers of neural network models that are closely associated with each other. The lower layer performs a morphological analysis, whereas the upper layer performs a named entity recognition. In our experiments using a public gold-labeled dataset, the proposed model outperformed previous state-of-the-art models used for morphological analysis and named entity recognition. Furthermore, the results indicated that the integrated architecture could alleviate the error propagation problem.

Highlights

IntroductionIn Korean, morphological analysis (MA) is generally performed in the order of morpheme segmentation and part-of-speech (POS)
A morpheme refers to the smallest meaningful word in a phrase
To obtain optimal label paths better than those obtained with conditional random fields (CRFs), a label attention network (LAN) was proposed, which captured the potential long-term label dependency by providing incrementally refined label distributions with hierarchical attention to each word

Summary

Introduction

In Korean, morphological analysis (MA) is generally performed in the order of morpheme segmentation and part-of-speech (POS). Many NER models generally use the results of morphological analysis as informative clues [1,2]. This pipeline architecture causes the well-known error propagation problem. MA models for agglutinative languages, such as Korean and Japanese, demonstrate worse performances than those of isolating languages, which significantly affect the performances of the corresponding NER models. Sci. 2020, 10, 3740 capitalization, detecting NEs without any morphological information such as morpheme boundaries and POS tags is difficult.

Correct Results

Previous Studies

Integrated Model for MA and NER

Morpheme

Datasets and Experimental Setups

Implementation

Experimental Results

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Integrated Model for Morphological Analysis and Named Entity Recognition Based on Label Attention Networks in Korean

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

A Comparative Study of Dictionary-based and Machine Learning-based Named Entity Recognition in Pashto
Rafiullah Momand ... Shakirullah Waseeb
-
Rafiullah Momand, et. al.Rafiullah Momand ... Shakirullah Waseeb
18 Dec 2020
18 Dec 2020

Effective integration of morphological analysis and named entity recognition based on a recurrent neural network
Hyeon-Gu Lee ... Harksoo Kim
Pattern Recognition Letters | VOL. 112
Hyeon-Gu Lee, et. al.Hyeon-Gu Lee ... Harksoo Kim
13 Aug 2018
Pattern Recognition Letters | VOL. 112

Named Entity Recognition using Support Vector Machine: A Language Independent Approach
...
Zenodo (CERN European Organization for Nuclear Research) | VOL. -
, et. al. ...
23 Mar 2010
Zenodo (CERN European Organization for Nuclear Research) | VOL. -

Named entity recognition and its role in unstructured data analysis
Oleh R Staso ... Nazarii Ye Burak
Informatics. Culture. Technology | VOL. 1
Oleh R Staso, et. al.Oleh R Staso ... Nazarii Ye Burak
26 Sep 2024
Informatics. Culture. Technology | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Integrated Model for Morphological Analysis and Named Entity Recognition Based on Label Attention Networks in Korean

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences