Measuring mono-word termhood by rank difference via corpus comparison

Chunyu Kit,Xiaoyue Liu

doi:10.1075/term.14.2.05kit

Abstract

Terminology as a set of concept carriers crystallizes our special knowledge about a subject. Automatic term recognition (ATR) plays a critical role in the processing and management of various kinds of information, knowledge and documents, e.g., knowledge acquisition via text mining. Measuring termhood properly is one of the core issues involved in ATR. This article presents a novel approach to termhood measurement for mono-word terms via corpus comparison, which quantifies the termhood of a term candidate as its rank difference in a domain and a background corpus. Our ATR experiments to identify legal terms in Hong Kong (HK) legal texts with the British National Corpus (BNC) as background corpus provide evidence to confirm the validity and effectiveness of this approach. Without any prior knowledge and ad hoc heuristics, it achieves a precision of 97.0% on the top 1000 candidates and a precision of 96.1% on the top 10% candidates that are most highly ranked by the termhood measure, illustrating a state-of-the-art performance on mono-word ATR in the field.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Measuring mono-word termhood by rank difference via corpus comparison

Abstract

Talk to us

Similar Papers

More From: Terminology / International Journal of Theoretical and Applied Issues in Specialized Communication

Lead the way for us

Journal: Terminology / International Journal of Theoretical and Applied Issues in Specialized Communication	Publication Date: Dec 12, 2008
Citations: 46

Similar Papers

The contribution of verbal semantic content towards term recognition
Eugenia Eumeridou
International Journal of Corpus Linguistics | VOL. 7
Eugenia EumeridouEugenia Eumeridou
18 Oct 2002
International Journal of Corpus Linguistics | VOL. 7

Methods of automatic term recognition
Kyo Kageura ... Bin Umino
Terminology / International Journal of Theoretical and Applied Issues in Specialized Communication | VOL. 3
Kyo Kageura, et. al.Kyo Kageura ... Bin Umino
01 Jan 1996
Terminology / International Journal of Theoretical and Applied Issues in Specialized Communication | VOL. 3

Legalese as Seen Through the Lens of Corpus Linguistics. An Introduction to Software Tools for Terminological Analysis

-

13 Aug 2017
13 Aug 2017

Automatic recognition of chinese scientific and technological terms using integrated linguistic knowledge
Zhifang Sui ... Zhouchao Wei
-
Zhifang Sui, et. al. Zhifang Sui ... Zhouchao Wei
26 Oct 2003
26 Oct 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Measuring mono-word termhood by rank difference via corpus comparison

Abstract

Talk to us

Similar Papers

More From: Terminology / International Journal of Theoretical and Applied Issues in Specialized Communication