Building neural network language model with POS-based negative sampling and stochastic conjugate gradient descent

Jin Liu,Jin Wang,Geumran Youn,Jeong-Uk Kim,Minghao Gu,Haoliang Ren,Li Lin

doi:10.1007/s00500-018-3181-2

Abstract

Traditional statistical language model is a probability distribution over sequences of words. It has the problem of curse of dimensionality incurred by the exponentially increasing number of possible sequences of words in training text. To solve this issue, neural network language models are proposed by representing words in a distributed way. Due to computation cost on updating a large number of word vectors’ gradients, neural network model needs much training time to converge. To alleviate this problem, in this paper, we propose a gradient descent algorithm based on stochastic conjugate gradient to accelerate the convergence of the neural network’s parameters. To improve the performance of the neural language model, we also propose a negative sampling algorithm based on POS (part of speech) tagging, which can optimize the negative sampling process and improve the quality of the final language model. A novel evaluation model is also used with perplexity to demonstrate the performance of the improved language model. Experiment results prove the effectiveness of our novel methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Building neural network language model with POS-based negative sampling and stochastic conjugate gradient descent

Abstract

Talk to us

Similar Papers

More From: Soft Computing

Lead the way for us

Journal: Soft Computing	Publication Date: Apr 24, 2018
Citations: 12

Similar Papers

Paraphrastic language models and combination with neural network language models
X Liu ... P C Woodland
-
X Liu, et. al.X Liu ... P C Woodland
01 May 2013
01 May 2013

Paraphrastic neural network language models
X Liu ... M J F Gales
-
X Liu, et. al.X Liu ... M J F Gales
01 May 2014
01 May 2014

Comparison of Various Neural Network Language Models in Speech Recognition
Lingyun Zuo ... Xin Wan
-
Lingyun Zuo, et. al.Lingyun Zuo ... Xin Wan
01 Jul 2016
01 Jul 2016

Language Model Score Regularization for Speech Recognition
Yike Zhang ... Pengyuan Zhang
Chinese Journal of Electronics | VOL. 28
Yike Zhang, et. al.Yike Zhang ... Pengyuan Zhang
01 May 2019
Chinese Journal of Electronics | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Building neural network language model with POS-based negative sampling and stochastic conjugate gradient descent

Abstract

Talk to us

Similar Papers

More From: Soft Computing