In industrial applications, it is difficult to extract the fault feature directly when the rolling bearing works under strong background noise. In addition, single-channel vibration sensor data pose limitations in providing a comprehensive representation of bearing fault features; how to effectively fuse data of each channel and extract features is a challenge. To solve the above-mentioned problems, a fault diagnosis method based on wavelet adaptive threshold filtering and multi-channel fusion cross-attention neural network is proposed in this paper. First, the multi-scale discrete wavelet transform is applied to obtain the wavelet coefficients of each channel. Adaptive threshold filtering is conducted to filter out noise and extract symbolic features. The threshold updates with the training of the network. Then, the wavelet coefficients are reconstructed and the channel attention is performed to further extract the symbolic features of the fault signal. Finally, the multi-channel fault signals are fused by a cross-attention module. This module can fully extract the features of each channel and fuse multi-channel data. To improve the generalization ability of the network, residual connections are added. To verify the effectiveness of the proposed method, experiments are carried out on the rolling bearing datasets of Case Western Reserve University and Xi’an Jiaotong University. In addition, the gas turbine main bearing dataset is also applied to prove the reliability of this method.