Abstract As one of the important feature categories in urban geographic data, buildings are the key thematic elements to be represented in large-scale urban mapping with the high speed of urban digital construction. The identification and extraction of buildings are of great significance for feature extraction, feature matching, image interpretation and mapping. However, the great variability of building size, shape, color, orientation, etc., in remote sensing images poses a great challenge to building detection. To this end, this paper proposes an algorithm based on multi-feature multi-scale fusion for the automatic extraction of buildings in remote sensing images are represented in the form of roofs. It is difficult to represent all buildings with a single feature because of the different colors, textures and shapes of building roofs. Effective features to describe buildings are proposed, including edge density and edge distribution, brightness contrast, color contrast and other features to describe building edge brightness. We propose effective features to describe buildings, including edge density and edge distribution, luminance contrast, color contrast and other underlying features to describe the edges, luminance and color of buildings, and adding special structural features such as main direction orthogonality and target integrity and symmetry to describe buildings by multiple features together. Moreover, the K-value nearest neighbor classification algorithm is used to train a series of samples, and the weights of each feature in the multi-feature model are obtained through iterative learning to obtain the multi-feature linear model and calculate the visual saliency of buildings in the sliding window; finally, the proposed algorithm has experimented with several groups of high-resolution remote sensing images respectively, and the multi-scale multi-feature fusion model algorithm is used as the Erkoff random field model to compare the algorithm. The results of this paper show that the proposed multiscale multi-feature fusion model algorithm improves by 10.82% for building classification accuracy extraction and 13.96% for feature selection extraction accuracy, and finally, the comparison from the shape optimization effect figure concludes that the multiscale multi-feature fusion model can achieve better extraction accuracy and practical effect for buildings in remote sensing images, which has certain practicality and It has certain practicality and superiority. It promotes the in-depth application of multi-feature multi-scale combined high-resolution remote sensing image-building extraction in geographic states, road traffic and other industries.t