An Improved Corpus-Based NLP Method for Facilitating Keyword Extraction: An Example of the COVID-19 Vaccine Hesitancy Corpus

Liang-Ching Chen

doi:10.3390/su15043402

Abstract

In the current COVID-19 post-pandemic era, COVID-19 vaccine hesitancy is hindering the herd immunity generated by widespread vaccination. It is critical to identify the factors that may cause COVID-19 vaccine hesitancy, enabling the relevant authorities to propose appropriate interventions for mitigating such a phenomenon. Keyword extraction, a sub-field of natural language processing (NLP) applications, plays a vital role in modern medical informatics. When traditional corpus-based NLP methods are used to conduct keyword extraction, they only consider a word’s log-likelihood value to determine whether it is a keyword, which leaves room for concerns about the efficiency and accuracy of this keyword extraction technique. These concerns include the fact that the method is unable to (1) optimize the keyword list by the machine-based approach, (2) effectively evaluate the keyword’s importance level, and (3) integrate the variables to conduct data clustering. Thus, to address the aforementioned issues, this study integrated a machine-based word removal technique, the i10-index, and the importance–performance analysis (IPA) technique to develop an improved corpus-based NLP method for facilitating keyword extraction. The top 200 most-cited Science Citation Index (SCI) research articles discussing COVID-19 vaccine hesitancy were adopted as the target corpus for verification. The results showed that the keywords of Quadrant I (n = 98) reached the highest lexical coverage (9.81%), indicating that the proposed method successfully identified and extracted the most important keywords from the target corpus, thus achieving more domain-oriented and accurate keyword extraction results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sustainability	Publication Date: Feb 13, 2023
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An Improved Corpus-Based NLP Method for Facilitating Keyword Extraction: An Example of the COVID-19 Vaccine Hesitancy Corpus

Abstract

Talk to us

Similar Papers

More From: Sustainability

Lead the way for us

Similar Papers

An extended TF-IDF method for improving keyword extraction in traditional corpus-based research: An example of a climate change corpus
Liang-Ching Chen
Data & Knowledge Engineering | VOL. 153
Liang-Ching ChenLiang-Ching Chen
30 May 2024
Data & Knowledge Engineering | VOL. 153

Ensemble of keyword extraction methods and classifiers in text classification
Aytuğ Onan ... Hasan Bulut
Expert Systems with Applications | VOL. 57
Aytuğ Onan, et. al.Aytuğ Onan ... Hasan Bulut
29 Mar 2016
Expert Systems with Applications | VOL. 57

Algorithm for Bengali Keyword Extraction
Md Ruhul Amin ... Madhusodan Chakraborty
-
Md Ruhul Amin, et. al.Md Ruhul Amin ... Madhusodan Chakraborty
01 Sep 2018
01 Sep 2018

A Survey on Keyword and Key Phrase Extraction with Deep Learning
Ozlem Unlu ... Aydin Cetin
-
Ozlem Unlu, et. al.Ozlem Unlu ... Aydin Cetin
01 Oct 2019
01 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Improved Corpus-Based NLP Method for Facilitating Keyword Extraction: An Example of the COVID-19 Vaccine Hesitancy Corpus

Abstract

Talk to us

Similar Papers

More From: Sustainability