This research addresses the challenge of generalizing deep learning models for different CFRP composite structures in the task of fatigue damage detection. To overcome this challenge, knowledge distillation is employed to enhance the generalizability of deep learning models. A teacher network processes continuous wavelet transform images using Fourier transform and neural networks, while a student network distills the teacher network. This framework improves the models' generalization performance by transferring knowledge from the teacher network to the student network. Additionally, soft gradient boosting is utilized to further enhance the generalizability. By constructing a main sub-network and multiple parallel auxiliary sub-networks within the teacher network, the student network mimics the main sub-network to achieve improved accuracy in the target domain and prevent overfitting. To augment limited datasets of real CFRP monitoring signals and help to learn domain-invariant features, structural digital twin technology is leveraged to generate simulated monitoring signals, which enables the models to capture domain invariant information, significantly enhancing its performance of fatigue damage detection across different structures. Damage detection based on the generalization results between multiple Layups demonstrates a test accuracy exceeding 80 % when the monitoring data of the target CFRP structure is unavailable during training. Therefore, the cross-structure damage detection ability of the proposed approach is well proved.