BackgroundSystemic lupus erythematosus (SLE) is a chronic autoimmune disease. Currently, the medical diagnosis of SLE mainly relies on the clinical experience of physicians, and there is no universally accepted objective method for diagnosing SLE. Therefore, there is an urgent need to design an intelligent approach to accurately diagnose SLE to assist physicians in formulating appropriate treatment plans. With the rapid development of intelligent medical diagnostic technology, medical data is becoming increasingly multimodal. Multimodal data fusion can provide richer information than single-modal data, and the fusion of multiple modalities can effectively enhance the richness of data features to improve modeling performance. ResultsIn this paper, a cross-modal specific transfer fusion technique based on infrared spectra and metabolomics is proposed to effectively integrate infrared spectra and metabolomics by fully exploiting the intrinsic relationships between features across different modalities, thus achieving the diagnosis of SLE. In this research, a Decision Level Fusion module is also proposed to fuse the representations of two specific transfers further, obtaining the final prediction scores. Comprehensive experimental results demonstrate that the proposed method significantly improves the performance of SLE prediction, with accuracy and Area Under Curve (AUC) reaching 94.98 % and 97.13 %, respectively, outperforming existing methods. SignificanceOur framework effectively integrates infrared spectra and metabolomics to achieve a more accurate prediction of SLE. Our research indicates that prediction methods based on different modalities outperform those using single-modality data. The Cross-modal Specific Transfer Fusion module effectively captures the complex relationships within each single modality and models the complex relationships between different modalities.