Abstract Rotating machinery is a crucial component of industrial equipment, and the fault diagnosis of bearings and gears, as vital elements of rotating machinery, is essential since they often fail under harsh working conditions, leading to significant property losses and serious personal safety problems. However, fault data for gears and bearings are often sparse in actual condition, and it is a challenge to ensure the reliability and stability of fault diagnosis results by extracting the features of a single data. To solve the above problems, this paper proposes a fault diagnosis method that combines Transfer Learning and data fusion techniques. Firstly, in this method, two kinds of fault signals are transformed into Gramian Angular Difference Fields and Recurrence Plot. Next, a U-shaped feature fusion dual discriminator generative adversarial network is used to fuse two-dimensional images from multiple sensor data. Its feature fusion module deeply integrates the features of the two images, thereby solving the impact of single data on the reliability and stability of fault diagnosis. Moreover, open-source datasets are used for Transfer Learning training to tackle the small sample problem. Finally, a decision-level information fusion classifier, the Dual-Branch Dempster-Shafer Classifier (DB-DSC), classifies the fused images. This classifier incorporates an improved soft threshold function and D-S evidence theory to achieve adaptive gradient changes and improve the robustness and accuracy of classification results. The experimental results show the effectiveness and stability of the proposed method, and the generated images get high score in several metrics. The average classification accuracy of the classification network reaches 93% and 92.5% on the two datasets, Therefore, the proposed method exhibits strong fault diagnosis capabilities under the small sample conditions of bearings and gears.