Distributionally Extended Network-based Word Sense Disambiguation in Semantic Clustering of Polish Texts

Paweł Kędzia,Maciej Piasecki,Jan Kocoń,Agnieszka Indyka-Piasecka

doi:10.1016/j.ieri.2014.09.073

Distributionally Extended Network-based Word Sense Disambiguation in Semantic Clustering of Polish Texts

Paweł Kędzia, Maciej Piasecki + Show 2 more

Open Access

https://doi.org/10.1016/j.ieri.2014.09.073

Copy DOI

Journal: IERI Procedia	Publication Date: Jan 1, 2014
Citations: 7	License type: cc-by-nc-nd

Affiliation: Wrocław University of Science and Technology

#Princeton WordNet #Graph-based Word Sense Disambiguation + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In the paper we present an extended version of the graph-based unsupervised Word Sense Disambiguation algorithm. The algorithm is based on the spreading activation scheme applied to the graphs dynamically built on the basis of the text words and a large wordnet. The algorithm, originally proposed for English and Princeton WordNet, was adapted to Polish and plWordNet. An extension based on the knowledge acquired from the corpus-derived Measure of Semantic Relatedness was proposed. The extended algorithm was evaluated against the manually disambiguated corpus. We observed improvement in the case of the disambiguation performed for shorter text contexts. In addition the algorithm application expressed improvement in document clustering task.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: IERI Procedia

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.