A WSD Model for Corpus Construction

Weiguang Qu,Xiaojun Zhang,Shiwen Yu,Sujian Li,Yunfang Wu,Zhifang Sui

doi:10.1142/s1793840608001743

A WSD Model for Corpus Construction

Weiguang Qu, Xiaojun Zhang + Show 4 more

https://doi.org/10.1142/s1793840608001743

Copy DOI

Journal: International Journal of Computer Processing of Languages

Publication Date: Mar 1, 2008

Affiliation: Shaanxi Normal University, Peking University, Nanjing Normal University

#Annotation Of Large Corpora #WSD Approaches + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

A common difficulty in the annotation of large corpora is the disambiguation of the meanings of polysemous words, a task which currently depends on unassisted manual checking by human experts. In this paper, we propose RFR-SUM, an approach to the automatic disambiguation of polysemous Chinese words that calculates and then sums the collocational strengths of words in the local context of a given word. The approach is efficient in that it learns from a small annotated corpus. It can also be used to support manual checking by automatically identifying sentences where usages are borderline and require a human decision. Testing has shown that RFR-SUM outperforms two other commonly used WSD approaches.

Full Text