Deep transfer learning is frequently employed to address the challenges arising from limited or hard-to-obtain training data in the target domain, but its application in axial compressors has been scarcely explored thus far. In this paper, a multi-objective optimization framework of a transonic rotor is established using deep transfer learning. This framework first pre-trains deep neural networks based on the peak efficiency condition of 100% design speed and then fine-tunes the networks to predict the performance of off-design conditions based on the small training dataset. Finally, the design optimization of the transonic rotor is carried out through non-dominated sorting genetic algorithm II. Compared to neural networks that are trained directly, transfer learning models can achieve higher prediction accuracy, particularly in scenarios with small training datasets. This is because the pre-trained weights can offer a better initial state for transfer learning models. Moreover, transfer learning models can use fewer samples to obtain an approximate Pareto front, making the optimized rotor increase the isentropic efficiency at both peak efficiency and high loading conditions. The efficiency improvement of the optimized rotor is attributed to the reduction of the loss associated with the tip leakage flow by adjusting the tip loading distribution. Overall, this study fully demonstrates the effectiveness of transfer learning in predicting compressor performance, which provides a promising approach to solving high-cost compressor design problems.