Lexical Patterns Based on Maximal Frequent Secuences for Automatic Keyphrase Extraction

Yanet Hernández Casimiro,Marco Antonio Ramos Corchado,Yulia Ledeneva,René Arnulfo García Hernández

doi:10.13053/cys-25-1-3868

Abstract

This paper presents a method for the automatic keyphrase extraction task using lexical patterns. First, the patterns are obtained from a set of data and converted into regular expression search patterns, allowing to consider sequences of characters that define a phrase without depending on its syntactic or semantic characteristics and thus obtain a list of possible candidates. Besides, to select the best, only those that obtained a high weight will be considered, in the following four weights: Boolean (B), Precision (P), Recall (R), and F-Measure (F); which corresponds to the result obtained from each evaluated pattern, therefore a list is generating of the best 5,10 and 15 keyphrases for each document. The evaluation of the method was realized by length (L) and combination (C), where the combination takes the best candidates for each length (1 to 4). The method was tested in corpus of scientific articles using the SemEval-2010 data set for task 5.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Lexical Patterns Based on Maximal Frequent Secuences for Automatic Keyphrase Extraction

Abstract

Talk to us

Similar Papers

More From: Computación y Sistemas

Lead the way for us

Similar Papers

Evaluation of mass spectral library search algorithms implemented in commercial software.
Andrey Samokhin ... Igor Revelsky
Journal of Mass Spectrometry | VOL. 50
Andrey Samokhin, et. al.Andrey Samokhin ... Igor Revelsky
05 May 2015
Journal of Mass Spectrometry | VOL. 50

Band Energy Difference for Source Attribution in Audio Forensics
Da Luo ... Jiwu Huang
IEEE Transactions on Information Forensics and Security | VOL. 13
Da Luo, et. al.Da Luo ... Jiwu Huang
01 Sep 2018
IEEE Transactions on Information Forensics and Security | VOL. 13

Apposition in the grammar of English

-

19 Mar 1992
19 Mar 1992

Fast physical object identification based on unclonable features and soft fingerprinting
Taras Holotyak ... Oleksiy Koval
-
Taras Holotyak, et. al.Taras Holotyak ... Oleksiy Koval
01 May 2011
01 May 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Lexical Patterns Based on Maximal Frequent Secuences for Automatic Keyphrase Extraction

Abstract

Talk to us

Similar Papers

More From: Computación y Sistemas