Conventional deep learning methods for fault detection often assume that the training and the testing sets share the same fault domain spaces. However, some fault patterns are rare, and many real-world faults have not appeared in the training set. As a result, it is hard for the trained model to achieve desirable performance on the testing set. In this paper, we introduce a novel domain generalization, Load-Domain (LD) domain generalization, which is based on the analysis of the Case Western Reserve University (CWRU) bearing dataset and takes advantage of the physical information of this dataset. For this scenario, we propose a feature shift model called Feature Shift Network (FSN). FSN is trained for feature shift on adjacent source domains and finally shifts target domain features into adjacent source domain feature space to achieve the purpose of domain generalization. Furthermore, through the hybrid classification method, the generalization performance of the model on unseen target domains is effectively improved. The results on the CWRU bearing dataset demonstrate that FSN is better than the existing models in the LD domain generalization. Furthermore, we have another test on the rotated MNIST, which also shows FSN can achieve the best performance.