Various physical characteristics of urban impervious surfaces (ISAs) and urban green spaces (UGSs) collectively regulate environmental temperatures through heating and cooling processes. However, current research often analyzes each regulating factor as an independent variable when examining its relationship with land surface temperature (LST), with limited studies considering the combined contribution weights of all regulating factors. Based on multi-source remote sensing data and ground observations from the near summers of 2014, 2016, 2017, and 2018 in the built-up area of Xuzhou City, numerical values and spatial distributions of 15 regulating factors, including ISA density (fi), land surface albedo (Albedo), population density (Population), anthropogenic heat flux (AHF), maximum ISA patch index (LPIISA), natural connectivity of ISA patches (COHESIONISA), aggregation index of ISA patches (AIISA), average shape index of ISA patches (SHAPE_MNISA), UGS density (fv), evapotranspiration (ET), UGS shading index (UGSSI), maximum UGS patch index (LPIUGS), natural connectivity of UGS patches (COHESIONUGS), aggregation index of UGS patches (AIUGS), and average shape index of UGS patches (SHAPE_MNUGS), were separately extracted within the study area. Using geographically weighted regression models and bivariate spatial autocorrelation models, we separately obtained the quantitative and spatial correlations between the 15 regulating factors and LST. The results revealed that all selected regulating factors exhibited high goodness-of-fit and significant spatial correlations with LST, which led to their categorization into eight warming factors and seven cooling factors. The factor detection of the Geographic Detector further reveals the combined contribution of all regulating factors to LST. The results indicate that cooling factors collectively have higher explanatory power for LST compared to warming factors, with UGSSI contributing the most to LST, while Population contributed the least. Furthermore, the interaction detection results of the Geographic Detector have highlighted variations in the explanatory power of different factor combinations on LST. Ultimately, it has identified factor combinations that have proven to be most effective in mitigating the urban heat environment across three scenarios: warming factors alone, cooling factors alone, and a combination of both warming and cooling factors. The suggested factor combinations are as follows: fi ∩ Albedo, fi ∩ LPIISA, UGSSI ∩ fv, UGSSI ∩ LPIUGS, fi ∩ UGSSI, and Albedo ∩ UGSSI. Therefore, our findings hold the potential to provide a valuable reference for urban planning and climate governance. Tailoring factor combinations to the local context and selecting the most effective ones can enable cost-effective mitigation of the urban heat environment.