Word Embedding에 PCA를 적용한 개체명 인식 모델을 위한 효율적인 학습방법 연구

Eun-Young Song,Hoe-Ryeon Choi,Hong-Chul Lee

doi:10.7232/jkiie.2019.45.1.030

Abstract

The Bidirectional LSTM CRF model used for Named Entity Recognition takes much time to train Named Entity. The hyper-parameters of Word Embedding used as input data in this model affect performance and training time. However, there is very little research on the number of dimensions, which is one of the parameters of Word Embedding. In this paper, we obtain proper number of 4-Word Embeddings (fastText, GloVe, skipgram, CBOW) considering performance and training time in Bidirectional LSTM CRF which can input large amount of data. Next, apply the PCA to the word vector in Word Embedding to reduce the dimension to small dimensional (10 dimensions) intervals. Therefore, applying PCA to conventional Word Embedding and training Word Embedding with small dimensional intervals shows that the model can be trained by maintaining or improving performance based on stable training time in fewer dimensions than conventional Word Embedding.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Word Embedding에 PCA를 적용한 개체명 인식 모델을 위한 효율적인 학습방법 연구

Abstract

Talk to us

Similar Papers

More From: Journal of the Korean Institute of Industrial Engineers

Lead the way for us

Journal: Journal of the Korean Institute of Industrial Engineers	Publication Date: Feb 28, 2019
Citations: 1

Similar Papers

A Character-Enhanced Chinese Word Embedding Model
Gang Yang ... Tianhao He
-
Gang Yang, et. al.Gang Yang ... Tianhao He
01 Jul 2019
01 Jul 2019

A comparison of word embeddings for the biomedical natural language processing.
Yanshan Wang ... Feichen Shen
Journal of Biomedical Informatics | VOL. 87
Yanshan Wang, et. al.Yanshan Wang ... Feichen Shen
12 Sep 2018
Journal of Biomedical Informatics | VOL. 87

Word Embeddings for Entity-Annotated Texts
Satya Almasian ... Andreas Spitz
-
Satya Almasian, et. al.Satya Almasian ... Andreas Spitz
01 Jan 2019
01 Jan 2019

Training word embeddings for deep learning in biomedical text mining tasks
Zhenchao Jiang ... Degen Huang
-
Zhenchao Jiang, et. al.Zhenchao Jiang ... Degen Huang
01 Nov 2015
01 Nov 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Word Embedding에 PCA를 적용한 개체명 인식 모델을 위한 효율적인 학습방법 연구

Abstract

Talk to us

Similar Papers

More From: Journal of the Korean Institute of Industrial Engineers