Incorporating linguistic knowledge for learning distributed word representations.

Yan Wang,Zhiyuan Liu,Maosong Sun

doi:10.1371/journal.pone.0118437

Yan Wang, Zhiyuan Liu + Show 1 more

Open Access

https://doi.org/10.1371/journal.pone.0118437

Copy DOI

Abstract

Combined with neural language models, distributed word representations achieve significant advantages in computational linguistics and text mining. Most existing models estimate distributed word vectors from large-scale data in an unsupervised fashion, which, however, do not take rich linguistic knowledge into consideration. Linguistic knowledge can be represented as either link-based knowledge or preference-based knowledge, and we propose knowledge regularized word representation models (KRWR) to incorporate these prior knowledge for learning distributed word representations. Experiment results demonstrate that our estimated word representation achieves better performance in task of semantic relatedness ranking. This indicates that our methods can efficiently encode both prior knowledge from knowledge bases and statistical knowledge from large-scale text corpora into a unified word representation model, which will benefit many tasks in text mining.

Highlights

The performance of text mining is heavily dependent on word representation
JO-SPR indicates Softmax Probability Regularizer trained by Joint Optimization, PO-SPR means Softmax Probability Regularizer trained by Post Optimization, and PO-ER means Euclidean Regularizer trained by Post Optimization
We propose a unified framework to incorporate prior knowledge into distributed word representation

Summary

Introduction

The most widely used methods of word representation are vector space models (VSM) [1], which represent word meanings with vectors, with each dimension corresponding to semantic or syntactic information of words. VSM can be used to conduct similarity measures by computing distances between vectors, and are widely adopted in various applications such as information retrieval, text classification and question answering. It has long been known that simple co-occurrence counts do not work well for DSM. Techniques such as reweighting, smoothing and dimension reduction have been proposed to enhance performance [2]. These optimization techniques require heavily manual tuning. DSM is non-trivial to be extended to higher level representation of sentences or documents

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PloS one	Publication Date: Apr 13, 2015
Citations: 15	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Incorporating linguistic knowledge for learning distributed word representations.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Similar Papers

An Unsupervised Graph Based Continuous Word Representation Method for Biomedical Text Mining.
Zhenchao Jiang ... Degen Huang
IEEE/ACM transactions on computational biology and bioinformatics | VOL. 13
Zhenchao Jiang, et. al.Zhenchao Jiang ... Degen Huang
14 Sep 2015
IEEE/ACM transactions on computational biology and bioinformatics | VOL. 13

RC-NET
Chang Xu ... Bin Gao
-
Chang Xu, et. al.Chang Xu ... Bin Gao
03 Nov 2014
03 Nov 2014

A Comprehensive Survey on Word Representation Models: From Classical to State-of-the-Art Word Representation Language Models
Usman Naseem ... Mukesh Prasad
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 20
Usman Naseem, et. al.Usman Naseem ... Mukesh Prasad
30 Jun 2021
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 20

On the Power of Pre-Trained Text Representations
Yu Meng ... Jiaxin Huang
-
Yu Meng, et. al.Yu Meng ... Jiaxin Huang
14 Aug 2021
14 Aug 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Incorporating linguistic knowledge for learning distributed word representations.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one