Environmental noise and transmission components can cause significant interference in vibration signals, rendering the extraction of bearing fault features challenging in service scenarios. Traditional fault diagnosis methods rely heavily on professional domain knowledge, prior models, and signal preprocessing methods. The accuracy of fault diagnosis depends on the quality of the fault-sensitive features extracted by vibration signal preprocessing methods. An improved convolutional neural network (CNN) end-to-end intelligent fault diagnosis model based on raw vibration data (RVDCNN) is proposed. The time-domain vibration signal of the transmission bearing is converted into a continuous two-dimensional numerical matrix, and a two-dimensional CNN model is constructed through network structure optimization. The original time-domain vibration signal numerical matrix of the bearing is trained and tested to extract and learn abstract fault features of different fault types, and then the fault classification of the bearing is achieved. To verify the generalizability of the RVDCNN intelligent fault diagnosis model, it is applied to the recognition of rolling bearings in the two-speed mechanical automatic transmission of electric vehicles, achieving recognition accuracy of 99.11% for seven types of bearings.