WEKE: Learning Word Embeddings for Keyphrase Extraction

Yuxiang Zhang,Huan Liu,Suge Wang,Bei Shi,Xiaoli Li

doi:10.1007/978-3-030-60290-1_19

Abstract

Traditional supervised keyphrase extraction models depend on the features of labeled keyphrases while prevailing unsupervised models mainly rely on global structure of the word graph, with nodes representing candidate words and edges/links capturing the co-occurrence between words. However, the local context information of the word graph can not be exploited in existing unsupervised graph-based keyphrase extraction methods and integrating different types of information into a unified model is relatively unexplored. In this paper, we propose a new word embedding model specially for keyphrase extraction task, which can capture local context information and incorporate them with other types of crucial information into the low-dimensional word vector to help better extract keyphrases. Experimental results show that our method consistently outperforms 7 state-of-the-art unsupervised methods on three real datasets in Computer Science area for keyphrase extraction.

Full Text