redict the sound quality of dual-phase Hy-Vo chain transmission system noise using a small sample size. Noise acquisition tests are conducted under various working conditions, followed by subjective evaluations using the equal interval direct one-dimensional method. Objective evaluations are performed using the Mel-frequency cepstral coefficient (MFCC). To understand the impact of the MFCC order and the frame number on prediction accuracy, MFCC feature maps of different specifications are analyzed. The dataset is expanded threefold using fuzzy generation with an appropriate membership degree. The convolutional neural network (CNN) is developed, utilizing MFCC feature maps as inputs and evaluation scores as outputs. Results indicate a positive correlation between the frame number and prediction accuracy, whereas higher MFCC orders introduce redundancy, reducing accuracy. The proposed CNN method outperforms three traditional machine learning approaches, demonstrating superior accuracy and resistance to overfitting.