Impact analysis of keyword extraction using contextual word embedding.

Muhammad Qasim Khan,Muhammad Roman,M Irfan Uddin,Saeed M Alshahrani,Abdullah Alharbi,Jameel Almalki,Wael Alosaimi,Abdul Shahid

doi:10.7717/peerj-cs.967

Muhammad Qasim Khan, Muhammad Roman + Show 6 more

Open Access

https://doi.org/10.7717/peerj-cs.967

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

A document’s keywords provide high-level descriptions of the content that summarize the document’s central themes, concepts, ideas, or arguments. These descriptive phrases make it easier for algorithms to find relevant information quickly and efficiently. It plays a vital role in document processing, such as indexing, classification, clustering, and summarization. Traditional keyword extraction approaches rely on statistical distributions of key terms in a document for the most part. According to contemporary technological breakthroughs, contextual information is critical in deciding the semantics of the work at hand. Similarly, context-based features may be beneficial in the job of keyword extraction. For example, simply indicating the previous or next word of the phrase of interest might be used to describe the context of a phrase. This research presents several experiments to validate that context-based key extraction is significant compared to traditional methods. Additionally, the KeyBERT proposed methodology also results in improved results. The proposed work relies on identifying a group of important words or phrases from the document’s content that can reflect the authors’ main ideas, concepts, or arguments. It also uses contextual word embedding to extract keywords. Finally, the findings are compared to those obtained using older approaches such as Text Rank, Rake, Gensim, Yake, and TF-IDF. The Journals of Universal Computer (JUCS) dataset was employed in our research. Only data from abstracts were used to produce keywords for the research article, and the KeyBERT model outperformed traditional approaches in producing similar keywords to the authors’ provided keywords. The average similarity of our approach with author-assigned keywords is 51%.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PeerJ Computer Science	Publication Date: May 30, 2022
Citations: 26	License type: CC BY 4.0

R Discovery Prime

Impact analysis of keyword extraction using contextual word embedding.

Abstract

Published Version

Talk to us

Similar Papers

More From: PeerJ Computer Science

Lead the way for us

Similar Papers

Keyword Extraction Using Support Vector Machine
Kuo Zhang ... Hui Xu
-
Kuo Zhang, et. al.Kuo Zhang ... Hui Xu
01 Jan 2006
01 Jan 2006

Прагматическая концепция истины – теория коммуникации в аналитической философии
Ekaterina Grigorenko
Ideas and Ideals | VOL. 13
Ekaterina GrigorenkoEkaterina Grigorenko
15 Jun 2021
Ideas and Ideals | VOL. 13

Mokymasis – vertybių ugdymo veiksnys: aksiologinės prasmės įžvalgos
Romanas Vasiliauskas
Acta Paedagogica Vilnensia | VOL. 27
Romanas VasiliauskasRomanas Vasiliauskas
01 Jan 2010
Acta Paedagogica Vilnensia | VOL. 27

A Keyword Extraction Method Based on Learning to Rank
Xianggao Cai ... Shujin Cao
-
Xianggao Cai, et. al.Xianggao Cai ... Shujin Cao
01 Aug 2017
01 Aug 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Impact analysis of keyword extraction using contextual word embedding.

Abstract

Published Version

Talk to us

Similar Papers

More From: PeerJ Computer Science