Rack and pinion drives (RPD) are widely used in battery swapping system (BSS) for electric heavy trucks (EHT), and due to the continuous heavy-load and high-intensity operation, along with the electric erosion, the gears in the RPD are always damaged, which causes unexpected consequences such as downtime or safety incidents. The working conditions of the RPD in BSS include uncertain noises, fluctuant and low speed, which pose steep challenges to accurate fault diagnosis. Considering the auditory resistance of interference, the low-frequency sensitivity of auditory perception, and the auditory saliency mechanism, to leverage the advantages of auditory perceptual mechanism in addressing the above challenges, as the contribution in artificial intelligence, we propose an entire vibration signal processing scheme based on auditory bionics, including some mathematical models for auditory mechanisms. For the application in engineering, the proposed scheme is employed for fault diagnosis of RPD in BSS in unique working conditions. First, adaptive resampling is used to smooth the speed fluctuation, then, Gammatone filters are employed to transform vibration signals to cochleograms, after that, based on auditory stream segregation and selective attention mechanisms, effective frequency channels and salient features are extracted from the cochleograms, besides, to improve the diagnosis accuracy, binaural features are also extracted, finally, based on (sectional) sparse representation and fusion, fault diagnosis is achieved. The effectiveness of the fault diagnosis scheme is demonstrated using a BSS prototype system.