To retrospect the impact of urbanization on vegetation, consistent and reliable maps that provide urban vegetation estimates are required. In this study, we make the first attempt to employ deep learning for urban vegetation mapping of 2120 cities in China using a 30-years Landsat archive (UV-30). We present a multimodal deep learning (MDL) model to combine features from 5 categories, namely reflective bands, remote sensing indices, texture variables, topographical variables, and time labels. We normalized elevation data locally to remove the inter-city differences, while retaining topographic details. We used time labels to account for spectral variability over time and Landsat sensors. The performance of the MDL models was compared with a vanilla deep neural network, random forest, and support vector regression. Results indicated that the fusion of different categories of features improved the prediction accuracy. The MDL performed best in most validation cities, and our results were superior to the fractional maps aggregated from high-resolution datasets in homogeneous urban landscapes. We further analyzed urban vegetation dynamics across the urban core-new town-outskirt gradients of different geographical regions and city sizes. We found a gradually diminishing de-greening trend in urban China, especially in urban cores and new towns. Nevertheless, we observed a more pronounced degradation of urban vegetation in the south regions of China compared to the north, and smaller cities show no evident urban vegetation increase. The suggested framework is valuable for urban vegetation assessments and facilitates a thorough understanding and management of urban ecosystems.
Read full abstract