Abstract

Urban areas are essential to daily human life; however, the urbanization process also brings about problems, especially in China. Urban mapping at large scales relies heavily on remote sensing (RS) data, which cannot capture socioeconomic features well. Geolocation datasets contain patterns of human movement, which are closely related to the extent of urbanization. However, the integration of RS and geolocation data for urban mapping is performed mostly at the city level or finer scales due to the limitations of geolocation datasets. Tencent provides a large-scale location request density (LRD) dataset with a finer temporal resolution, and makes large-scale urban mapping possible. The objective of this study is to combine multi-source features from RS and geolocation datasets to extract information on urban areas at large scales, including night-time lights, vegetation cover, land surface temperature, population density, LRD, accessibility, and road networks. The random forest (RF) classifier is introduced to deal with these high-dimension features on a 0.01 degree grid. High spatial resolution land cover (LC) products and the normalized difference built-up index from Landsat are used to label all of the samples. The RF prediction results are evaluated using validation samples and compared with LC products for four typical cities. The results show that night-time lights and LRD features contributed the most to the urban prediction results. A total of 176,266 km2 of urban areas in China were extracted using the RF classifier, with an overall accuracy of 90.79% and a kappa coefficient of 0.790. Compared with existing LC products, our results are more consistent with the manually interpreted urban boundaries in the four selected cities. Our results reveal the potential of Tencent LRD data for the extraction of large-scale urban areas, and the reliability of the RF classifier based on a combination of RS and geolocation data.

Highlights

  • Urban areas account for a small proportion of global land cover, but support daily human life and exert a great influence on environmental and ecological changes [1,2]

  • Several general problems need to be addressed in the construction of a large-scale random forest (RF) classifier, such as parameter settings and high-quality sample labeling. In light of these problems, the objective of this study is to propose a framework for combining Remote sensing (RS) and geolocation features from multiple sources and extracting information on large-scale urban areas using an RF classifier

  • The random forest (RF) classifier was applied to extract urban areas based on six RS and eight geographical information system (GIS) features at the spatial resolution of 0.01 degree

Read more

Summary

Introduction

Urban areas account for a small proportion of global land cover, but support daily human life and exert a great influence on environmental and ecological changes [1,2]. Understating the extent and distribution of an urban area is a priority for sustainable development, ecological protection, and land use planning [3,4]. In addition to traditional diurnal imagery, night-time light (NTL) imagery has been introduced to map urban areas due to its simpler and more uniform characteristics [3,11]. Land surface temperature (LST) and vegetation index (VI) features from diurnal RS datasets are two common and quantitative land cover (LC) variables. Urban areas lack vegetation cover and have a higher LST due to the urban heat island effect [14,15,16]. An NTL, LST, and VI features composite can help us to better delineate urban areas [17]

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call