Adapted TextRank for Term Extraction: A Generic Method of Improving Automatic Term Extraction Algorithms

Ziqi Zhang,Johann Petrak,Diana Maynard

doi:10.1016/j.procs.2018.09.010

Ziqi Zhang, Johann Petrak + Show 1 more

Open Access

https://doi.org/10.1016/j.procs.2018.09.010

Copy DOI

Journal: Procedia Computer Science	Publication Date: Jan 1, 2018
Citations: 24	License type: cc-by-nc-nd

Affiliation: University of Sheffield

Abstract

Automatic Term Extraction is a fundamental Natural Language Processing task often used in many knowledge acquisition processes. It is a challenging NLP task due to its high domain dependence: no existing methods can consistently outperform others in all domains, and good ATE is very much an unsolved problem. We propose a generic method for improving the ranking of terms extracted by a potentially wide range of existing ATE methods. We re-design the well-known TextRank algorithm to work at corpus level, using easily obtainable domain resources in the form of seed words or phrases, to compute a score for a word from the target dataset. This is used to refine a candidate term’s score computed by an existing ATE method, potentially improving the ranking of real terms to be selected for tasks such as ontology engineering. Evaluation shows consistent improvement on 10 state of the art ATE methods by up to 25 percentage points in average precision measured at top-ranked K candidates.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adapted TextRank for Term Extraction: A Generic Method of Improving Automatic Term Extraction Algorithms

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Similar Papers

SemRe-Rank
Ziqi Zhang ... Jie Gao
ACM Transactions on Knowledge Discovery from Data | VOL. 12
Ziqi Zhang, et. al.Ziqi Zhang ... Jie Gao
27 Jun 2018
ACM Transactions on Knowledge Discovery from Data | VOL. 12

A survey of automatic term extraction for Brazilian Portuguese
Merley Da Silva Conrado ... Thiago Alexandre Salgueiro Pardo
Journal of the Brazilian Computer Society | VOL. 20
Merley Da Silva Conrado, et. al.Merley Da Silva Conrado ... Thiago Alexandre Salgueiro Pardo
30 May 2014
Journal of the Brazilian Computer Society | VOL. 20

Automatic Arabic term extraction from special domain corpora
Abdul Mohsen Al-Thubaity ... Badriyya Alonazi
-
Abdul Mohsen Al-Thubaity, et. al.Abdul Mohsen Al-Thubaity ... Badriyya Alonazi
01 Oct 2014
01 Oct 2014

Validação de termos de domínio por meio de uma base lexical-semântica difusa

Tradterm | VOL. 30

20 Dec 2017
Tradterm | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adapted TextRank for Term Extraction: A Generic Method of Improving Automatic Term Extraction Algorithms

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science