Classifying natural-language spatial relation terms with random forest algorithm

Shihong Du,Xiaonan Wang,Chen-Chieh Feng,Xiuyuan Zhang

doi:10.1080/13658816.2016.1212356

Abstract

ABSTRACTThe exponential growth of natural language text data in social media has contributed a rich data source for geographic information. However, incorporating such data source for GIS analysis faces tremendous challenges as existing GIS data tend to be geometry based while natural language text data tend to rely on natural language spatial relation (NLSR) terms. To alleviate this problem, one critical step is to translate geometric configurations into NLSR terms, but existing methods to date (e.g. mean value or decision tree algorithm) are insufficient to obtain a precise translation. This study addresses this issue by adopting the random forest (RF) algorithm to automatically learn a robust mapping model from a large number of samples and to evaluate the importance of each variable for each NLSR term. Because the semantic similarity of the collected terms reduces the classification accuracy, different grouping schemes of NLSR terms are used, with their influences on classification results being evaluated. The experiment results demonstrate that the learned model can accurately transform geometric configurations into NLSR terms, and that recognizing different groups of terms require different sets of variables. More importantly, the results of variable importance evaluation indicate that the importance of topology types determined by the 9-intersection model is weaker than metric variables in defining NLSR terms, which contrasts to the assertion of ‘topology matters, metric refines’ in existing studies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Classifying natural-language spatial relation terms with random forest algorithm

Abstract

Talk to us

Similar Papers

More From: International Journal of Geographical Information Science

Lead the way for us

Journal: International Journal of Geographical Information Science	Publication Date: Jul 25, 2016
Citations: 21

Similar Papers

Interpreting the Fuzzy Semantics of Natural-Language Spatial Relation Terms with the Fuzzy Random Forest Algorithm
Xiaonan Wang ... Xueying Zhang
ISPRS International Journal of Geo-Information | VOL. 7
Xiaonan Wang, et. al.Xiaonan Wang ... Xueying Zhang
07 Feb 2018
ISPRS International Journal of Geo-Information | VOL. 7

Ground visibility prediction using tree-based and random-forest machine learning algorithm: Comparative study based on atmospheric pollution and atmospheric boundary layer data
Fuzeng Wang ... Shujie Yuan
Atmospheric Pollution Research | VOL. 15
Fuzeng Wang, et. al.Fuzeng Wang ... Shujie Yuan
29 Jul 2024
Atmospheric Pollution Research | VOL. 15

Intelligent gravitational search random forest algorithm for fake news detection
Rathika Natarajan ... Mohammed Faez Hasan
International Journal of Modern Physics C | VOL. 33
Rathika Natarajan, et. al.Rathika Natarajan ... Mohammed Faez Hasan
31 Dec 2021
International Journal of Modern Physics C | VOL. 33

Decision Tree and Random Forest Classification Algorithms for Mangrove Forest Mapping in Sembilang National Park, Indonesia
Anang Dwi Purwanto ... Albertus Deliar
Remote Sensing | VOL. 15
Anang Dwi Purwanto, et. al.Anang Dwi Purwanto ... Albertus Deliar
21 Dec 2022
Remote Sensing | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Classifying natural-language spatial relation terms with random forest algorithm

Abstract

Talk to us

Similar Papers

More From: International Journal of Geographical Information Science