With China’s rapid industrialization and urbanization in the process of socio-economic development, the extensive use of energy has resulted in a large amount of CO2 emissions, which puts great pressure on China’s carbon emission reduction task. Through multivariate socio-economic data, this paper proposes an extraction and screening method of multivariate variables based on land-use types, and the downscaled spatial decomposition of carbon emissions at different scales was carried out by using the spatial lag model (SLM). This paper makes up for the shortcomings of previous studies, such as an insufficient modeling scale, simple modeling variables, limited spatio-temporal span of spatial decomposition, and no consideration of geographical correlation. Based on the results of the spatial decomposition of carbon emissions, this paper explores the spatial and temporal dynamics of carbon emissions at different scales. The results showed that SLM is capable of downscaling the spatialization of carbon emissions with high precision, and the continuity of the decomposition results at the provincial scale is stronger, while the differences of the decomposition results at the municipal scale are more obvious within the municipal units. In terms of the spatial and temporal dynamics of CO2 emissions, carbon emissions at both scales showed a significant positive correlation. The dominant spatial correlation types are “Low–Low” at the provincial level, and “Low–Low” and “High–High” at the municipal level. The smaller spatial scope is more helpful to show the geographic dependence and geographic differences of China’s carbon emissions. The findings of this paper will help deepen the understanding of the spatial and temporal changes of carbon emissions in China. They will provide a scientific basis for the formulation of feasible carbon emission reduction policies.