A conceptual density‐based approach for the disambiguation of toponyms

Davide Buscaldi,Paulo Rosso

doi:10.1080/13658810701626251

Abstract

Nowadays, a huge quantity of information is stored in digital format. A great portion of this information is constituted by textual and unstructured documents, where geographical references are usually given by means of place names. A common problem with textual information retrieval is represented by polysemous words, that is, words can have more than one sense. This problem is present also in the geographical domain: place names may refer to different locations in the world. In this paper we investigate the use of our word sense disambiguation technique in the geographical domain, with the aim of resolving ambiguous place names. Our technique is based on WordNet conceptual density. Due to the lack of a reference corpus tagged with WordNet senses, we carried out the experiments over a set of 1,210 place names extracted from the SemCor corpus that we named GeoSemCor and made publicly available. We compared our method with the most‐frequent baseline and the enhanced‐Lesk method, which previously has not been tested in large contexts. The results show that a better precision can be achieved by using a small context (phrase level), whereas a greater coverage can be obtained by using large contexts (document level). The proposed method should be tested with other corpora, due to the fact that our experiments evidenced the excessive bias towards the most‐frequent sense of the GeoSemCor.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A conceptual density‐based approach for the disambiguation of toponyms

Abstract

Talk to us

Similar Papers

More From: International Journal of Geographical Information Science

Lead the way for us

Journal: International Journal of Geographical Information Science	Publication Date: Mar 1, 2008
Citations: 105

Similar Papers

A Spatially-Aware Data-Driven Approach to Automatically Geocoding Non-Gazetteer Place Names
Praval Sharma ... Leen-Kiat Soh
ACM Transactions on Spatial Algorithms and Systems | VOL. 10
Praval Sharma, et. al.Praval Sharma ... Leen-Kiat Soh
11 Dec 2023
ACM Transactions on Spatial Algorithms and Systems | VOL. 10

Translation of Personal and Place Names from and into Chinese in Modern China: A Lexicographical History Perspective
Wensheng Qu ... Run Li
International Journal for the Semiotics of Law - Revue internationale de Sémiotique juridique | VOL. 28
Wensheng Qu, et. al.Wensheng Qu ... Run Li
03 Mar 2015
International Journal for the Semiotics of Law - Revue internationale de Sémiotique juridique | VOL. 28

Representing Semantics of Text by Acquiring its Canonical Form
Mohammed Ahmed Taiye ... Siti Sakira Kamaruddin
International Journal on Advanced Science, Engineering and Information Technology | VOL. 7
Mohammed Ahmed Taiye, et. al.Mohammed Ahmed Taiye ... Siti Sakira Kamaruddin
15 Jun 2017
International Journal on Advanced Science, Engineering and Information Technology | VOL. 7

До питання інтерпретації деяких географічних назв у курсі країнознавства Великобританії
...
-
, et. al. ...
01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A conceptual density‐based approach for the disambiguation of toponyms

Abstract

Talk to us

Similar Papers

More From: International Journal of Geographical Information Science