Comparison of Twelve Machine Learning Regression Methods for Spatial Decomposition of Demographic Data Using Multisource Geospatial Data: An Experiment in Guangzhou City, China

Guanwei Zhao,Zhitao Li,Muzhuang Yang

doi:10.3390/app11209424

Abstract

The spatial decomposition of demographic data at a fine resolution is a classic and crucial problem in the field of geographical information science. The main objective of this study was to compare twelve well-known machine learning regression algorithms for the spatial decomposition of demographic data with multisource geospatial data. Grid search and cross-validation methods were used to ensure that the optimal model parameters were obtained. The results showed that all the global regression algorithms used in the study exhibited acceptable results, besides the ordinary least squares (OLS) algorithm. In addition, the regularization method and the subsetting method were both useful for alleviating overfitting in the OLS model, and the former was better than the latter. The more competitive performance of the nonlinear regression algorithms than the linear regression algorithms implies that the relationship between population density and influence factors is likely to be non-linear. Among the global regression algorithms used in the study, the best results were achieved by the k-nearest neighbors (KNN) regression algorithm. In addition, it was found that multi-sources geospatial data can improve the accuracy of spatial decomposition results significantly, and thus the proposed method in our study can be applied to the study of spatial decomposition in other areas.

Highlights

Information about fine-scale population distribution is essential in many areas, including urban planning and management [1], natural disaster response [2], infectious disease prevention and control [3], resource allocation, and environment protection [4]
Since the nonlinear regression model can deal better with the collinearity of independent variables and other problems that lead to overfitting, we suggest that when conducting research on the spatial decomposition of demographic data, priority should be given to using nonlinear regression models to improve the accuracy of results
This paper compares the use of twelve machine learning regression algorithms in gridded population mapping of Guangzhou city, China

Summary

Introduction

Information about fine-scale population distribution is essential in many areas, including urban planning and management [1], natural disaster response [2], infectious disease prevention and control [3], resource allocation, and environment protection [4]. Accurate population distribution data are fundamental for the achievement of urban sustainable development goals (SDGs) [5,6]. The census method is the main way to collect population data in varying countries. The spatial resolution and update frequency of census data are too low to meet the requirements of modern urban governance. Fine-scale and accurate population information is essential for exploring the relationship between urban residents and the built environment [1]

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Oct 11, 2021
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Comparison of Twelve Machine Learning Regression Methods for Spatial Decomposition of Demographic Data Using Multisource Geospatial Data: An Experiment in Guangzhou City, China

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Red herrings revisited: spatial autocorrelation and parameter estimation in geographical ecology
Bradford A Hawkins ... Luis Mauricio Bini
Ecography | VOL. 30
Bradford A Hawkins, et. al.Bradford A Hawkins ... Luis Mauricio Bini
01 Jun 2007
Ecography | VOL. 30

Geographically Weighted Regression Effects on Soil Zinc Content Hyperspectral Modeling by Applying the Fractional-Order Differential
Xue Lin ... Jinming Sha
Remote Sensing | VOL. 11
Xue Lin, et. al.Xue Lin ... Jinming Sha
15 Mar 2019
Remote Sensing | VOL. 11

Potential Contribution of Residuals for Better Prediction of Soil Salinity from Remote Sensing Data
...
-
, et. al. ...
01 Jan 2006
01 Jan 2006

The association of county-level socioeconomic factors with individual tobacco and alcohol use: a longitudinal study of U.S. adults
Rita Hamad ... Daniel M Brown
BMC Public Health | VOL. 19
Rita Hamad, et. al.Rita Hamad ... Daniel M Brown
11 Apr 2019
BMC Public Health | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of Twelve Machine Learning Regression Methods for Spatial Decomposition of Demographic Data Using Multisource Geospatial Data: An Experiment in Guangzhou City, China

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences