ABSTRACT Recently, U-Net architecture has been extensively explored for remote sensing (RS) image haze removal, demonstrating remarkable performance. However, most existing RS image haze removal methods based on the U-Net fail to fully utilize feature information extracted in the encoding layers during decoding, resulting in unsatisfactory haze removal results. Moreover, most haze removal algorithms neglect the interaction of cross-level feature information, which is particularly important for scene recovery and context information constraints. To address these issues, this paper presents an Embedded U-shaped Network with Cross-Hierarchical Feature Adaptation Fusion Network (EUCHA). Specifically, an Embedded U-shaped Network Framework (EUNF) is proposed to fuse the features extracted from different scale encoding layers as supplementary information into the corresponding decoding layers by embedding multiple parallel U-shaped subnetworks, which alleviates feature information dilution during the decoding process and enhances the network receptive field. Furthermore, we propose a Cross-Hierarchical Feature Adaptive Fusion (CHAF) module, which effectively and adaptively fuses multiple adjacent hierarchical features extracted by channel attention and pixel attention through learnable factors to enhance the expressive ability of features. Experiments were conducted on both real and synthetic datasets and compared with the nine most advanced algorithms. The results showed that EUCHA algorithm performs better in removing artefacts in both RS images and natural images. The source code is available at https://github.com/GongHongY/EUCHA.