Patent Keyword Extraction Algorithm Based on Distributed Representation for Patent Classification

Jie Hu,Liya Yu,Shaobo Li,Jianjun Hu,Yong Yao,Guanci Yang

doi:10.3390/e20020104

Jie Hu, Liya Yu + Show 4 more

Open Access

https://doi.org/10.3390/e20020104

Copy DOI

Journal: Entropy	Publication Date: Feb 2, 2018
Citations: 75	License type: CC BY 4.0

Affiliation: Guizhou University, University of South Carolina

Abstract

Many text mining tasks such as text retrieval, text summarization, and text comparisons depend on the extraction of representative keywords from the main text. Most existing keyword extraction algorithms are based on discrete bag-of-words type of word representation of the text. In this paper, we propose a patent keyword extraction algorithm (PKEA) based on the distributed Skip-gram model for patent classification. We also develop a set of quantitative performance measures for keyword extraction evaluation based on information gain and cross-validation, based on Support Vector Machine (SVM) classification, which are valuable when human-annotated keywords are not available. We used a standard benchmark dataset and a homemade patent dataset to evaluate the performance of PKEA. Our patent dataset includes 2500 patents from five distinct technological fields related to autonomous cars (GPS systems, lidar systems, object recognition systems, radar systems, and vehicle control systems). We compared our method with Frequency, Term Frequency-Inverse Document Frequency (TF-IDF), TextRank and Rapid Automatic Keyword Extraction (RAKE). The experimental results show that our proposed algorithm provides a promising way to extract keywords from patent texts for patent classification.

Highlights

Patents are an important part of intellectual property
The reliability and performance of subsequent analyses will be affected, which in turn makes it hard to draw reliable insights from analysis results. Considering these issues, this paper examines the effectiveness of deep learning-based keyword extraction methods and proposes a keyword extraction method based on the Skip-gram [20,21,22] model to effectively extract keywords from patent text for patent classification
We develop a method to extract representative keywords from patents, which are used as the features of the patent text for high performance classification by Support Vector Machine (SVM) classifiers

Summary

Introduction

Patents are an important part of intellectual property. Effective patent analysis may bring lots of benefits for the enterprise. Usually automated patent classifiers are applied to a huge number of patent applications, which are inspected by patent examiner to check the proof for the classification to make final classification decision. This is especially true for classification predictions that have low confidence by the classifiers. Due to this special requirement, high-performance patent classifiers that can explain their classification with extracted keywords, ready for quick inspection by the patent examiner, are strongly desirable

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Patent Keyword Extraction Algorithm Based on Distributed Representation for Patent Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

Text Keyword Extraction Based on Multi-dimensional Features
Yu Jin ... Lizhen Xu
-
Yu Jin, et. al.Yu Jin ... Lizhen Xu
01 Jan 2020
01 Jan 2020

A Graph based Approach for Keyword Extraction from Documents
S Anjali ... M.G Thushara
-
S Anjali, et. al.S Anjali ... M.G Thushara
01 Feb 2019
01 Feb 2019

Research on Keyword Extraction Algorithm in English Text Based on Cluster Analysis.
Jingxia Ma
Computational Intelligence and Neuroscience | VOL. 2022
Jingxia MaJingxia Ma
28 Mar 2022
Computational Intelligence and Neuroscience | VOL. 2022

A Content-Based Collaborative Filtering Movie Recommendation System using Keywords Extractions
Mtuthuko Mngomezulu ... Ritesh Ajoodha
-
Mtuthuko Mngomezulu, et. al.Mtuthuko Mngomezulu ... Ritesh Ajoodha
27 Oct 2022
27 Oct 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Patent Keyword Extraction Algorithm Based on Distributed Representation for Patent Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy