Simple methods to overcome the limitations of general word representations in natural language processing tasks

Hongyeon Yu,Jaehyun An,Jeongmin Yoon,Hyemin Kim,Youngjoong Ko

doi:10.1016/j.csl.2019.04.009

Abstract

Although general word representations (GWRs) by skip-gram or GloVe have been widely used in many natural language processing (NLP) tasks with considerable success, they require further improvement. First, a GWR only represents general information of a word, even though task-oriented information can be more useful in specific tasks. Second, a GWR cannot avoid the out-of-vocabulary (OOV) problem. Thus, some recent studies have proposed methods based on an additional complex model or deep knowledge of resources for each specific task. Although such methods have the potential for improved performance, we believe that the baseline systems of each NLP task are already expensive; hence, making them more complex would be problematic for real-world applications. Therefore, the objective of this study is to overcome the limitations of GWRs by developing simple but effective methods for task-specific word representations (TSWRs) and OOV representations (OOVRs). The proposed methods achieved state-of-the-art performance in four Korean NLP tasks, namely part-of-speech tagging, named entity recognition, dependency parsing, and semantic role labeling.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Simple methods to overcome the limitations of general word representations in natural language processing tasks

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Jun 15, 2019
Citations: 8

Similar Papers

Introduction
Cícero Nogueira Dos Santos ... Ruy Luiz Milidiú
-
Cícero Nogueira Dos Santos, et. al.Cícero Nogueira Dos Santos ... Ruy Luiz Milidiú
01 Jan 2012
01 Jan 2012

Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for Accurate Natural Language Task Modeling
Wrick Talukdar ... Anjanava Biswas
International Journal of Innovative Science and Research Technology (IJISRT) | VOL. -
Wrick Talukdar, et. al.Wrick Talukdar ... Anjanava Biswas
03 Jun 2024
International Journal of Innovative Science and Research Technology (IJISRT) | VOL. -

Automatic Extraction of Comprehensive Drug Safety Information from Adverse Drug Event Narratives in the Korea Adverse Event Reporting System Using Natural Language Processing Techniques.
Siun Kim ... Yesol Hong
Drug Safety | VOL. 46
Siun Kim, et. al.Siun Kim ... Yesol Hong
17 Jun 2023
Drug Safety | VOL. 46

Chinese Nominal Entity Recognition with Semantic Role Labeling
Wenbo Pang ... Xiaozhong Fan
-
Wenbo Pang, et. al.Wenbo Pang ... Xiaozhong Fan
01 Dec 2009
01 Dec 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Simple methods to overcome the limitations of general word representations in natural language processing tasks

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language