Keyphrase Extraction Research Articles

Many Information Retrieval (IR) approaches have been proposed to extract relevant information from a large corpus. Among these methods, phrase-based retrieval methods have been proven to capture more concrete and concise information than word-based and paragraph-based methods. However, due to the complex relationship among phrases and a lack of proper visual guidance, achieving user-driven interactive information-seeking and retrieval remains challenging. In this study, we present a visual analytic approach for users to seek information from an extensive collection of documents efficiently. The main component of our approach is a PhraseMap, where nodes and edges represent the extracted keyphrases and their relationships, respectively, from a large corpus. To build the PhraseMap, we extract keyphrases from each document and link the phrases according to word attention determined using modern language models, i.e., BERT. As can be imagined, the graph is complex due to the extensive volume of information and the massive amount of relationships. Therefore, we develop a navigation algorithm to facilitate information seeking. It includes (1) a question-answering (QA) model to identify phrases related to users' queries and (2) updating relevant phrases based on users' feedback. To better present the PhraseMap, we introduce a resource-controlled self-organizing map (RC-SOM) to evenly and regularly display phrases on grid cells while expecting phrases with similar semantics to stay close in the visualization. To evaluate our approach, we conducted case studies with three domain experts in diverse literature. The results and feedback demonstrate its effectiveness, usability, and intelligence.

Read full abstract

The exponential growth of textual data poses a monumental challenge for extracting meaningful knowledge. Manually identifying descriptive keywords or keyphrases for each document is infeasible given the massive daily generated text. Automatic keyphrase extraction is, therefore, essential. However, current techniques struggle with learning the most salient semantic features from lengthy documents. This hybrid keyphrase extraction framework uniquely combines the complementary strengths of graph-based and textual feature methods. Our approach demonstrates improved performance over relying solely on statistical or graphical. Graph-based systems leverage word co- occurrence networks to score importance. Textual methods extract keyphrases using linguistic properties. Together, these complementary techniques overcome the limitations of relying on any strategy. The hybrid approach is evaluated on standard SemEval 2017 Task 10 and SemEval 2010 Task 5 benchmark datasets for scientific paper keyphrase extraction. Performance is quantified using the F1 score relative to human-annotated ground truth keyphrase. Results will quantify effectiveness on long documents with thousands of terms where only a few keywords represent salient concepts. Results show our technique effectively identifies the most salient semantic keywords, overcoming limitations of current techniques that struggle to mix features of graphical or statistical methods. Our experiments demonstrate that the proposed hybrid approach achieves superior F1 scores compared to current state-of-the-art methods on benchmark datasets. These results validate that synergistically combining graph and textual features enables more accurate keyphrase extraction, especially for long documents laden with extraneous terms.

Read full abstract

Keyphrase Extraction Research Articles

Related Topics

Articles published on Keyphrase Extraction

Stop-Word Lists in Keyphrase Extraction: Their Influence and Comparison

Unsupervised Keyphrase Extraction: Ranking Step and Single-Word Phrase Problem

HCUKE: A Hierarchical Context-aware approach for Unsupervised Keyphrase Extraction

Comparative Analysis on Automatic Keyphrase Extraction (AKPE) Techniques

A method of identifying domain-specific academic user information needs based on academic Q&A communities

Extracting Key-phrase Embedding using Deep Average Network and Maximal Marginal Relevance to Enhance Information Retrieval

Enhancing unsupervised keyphrase extraction through the integration of structural details in embedding-based approaches

Keyphrase Extraction from Scientific Articles

Comparative Evaluation of Keyphrase Extraction Tools for Semantic Analysis of Climate Change Scientific Reports and Ontology Enrichment

Keyphrase extraction using graph-based statistical approach with NLP patterns

A Systematic Review of Research on Gender Diversity in STEM Education

Developing a hierarchical model for unraveling conspiracy theories

AdaptiveUKE: Towards adaptive unsupervised keyphrase extraction with gated topic modeling

A Contrastive Learning Framework for Keyphrase Extraction

MICRank: Multi-information interconstrained keyphrase extraction

Y-Rank: A Multi-Feature-Based Keyphrase Extraction Method for Short Text

Keyphrase Extraction Using TextRank for Indonesian Text

PhraseMap: Attention-Based Keyphrases Recommendation for Information Seeking.

Hybrid Approach To Unsupervised Keyphrase Extraction

Key phrase extraction from patient’s chief complaints

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Keyphrase Extraction Research Articles

Related Topics

Articles published on Keyphrase Extraction

Stop-Word Lists in Keyphrase Extraction: Their Influence and Comparison

Unsupervised Keyphrase Extraction: Ranking Step and Single-Word Phrase Problem

HCUKE: A Hierarchical Context-aware approach for Unsupervised Keyphrase Extraction

Comparative Analysis on Automatic Keyphrase Extraction (AKPE) Techniques

A method of identifying domain-specific academic user information needs based on academic Q&A communities

Extracting Key-phrase Embedding using Deep Average Network and Maximal Marginal Relevance to Enhance Information Retrieval

Enhancing unsupervised keyphrase extraction through the integration of structural details in embedding-based approaches

Keyphrase Extraction from Scientific Articles

Comparative Evaluation of Keyphrase Extraction Tools for Semantic Analysis of Climate Change Scientific Reports and Ontology Enrichment

Keyphrase extraction using graph-based statistical approach with NLP patterns

A Systematic Review of Research on Gender Diversity in STEM Education

Developing a hierarchical model for unraveling conspiracy theories

AdaptiveUKE: Towards adaptive unsupervised keyphrase extraction with gated topic modeling

A Contrastive Learning Framework for Keyphrase Extraction

MICRank: Multi-information interconstrained keyphrase extraction

Y-Rank: A Multi-Feature-Based Keyphrase Extraction Method for Short Text

Keyphrase Extraction Using TextRank for Indonesian Text

PhraseMap: Attention-Based Keyphrases Recommendation for Information Seeking.

Hybrid Approach To Unsupervised Keyphrase Extraction

Key phrase extraction from patient’s chief complaints