Accurate short-term wind power prediction can help grid managers optimize the transmission and storage of electricity from wind farms, avoiding energy wastage and unnecessary operating costs, and ensuring the stability and reliability of power supply. However, stochastic nature and uncertainty data pose challenges for accurate wind power prediction models. This study combines complementary ensemble empirical mode decomposition (CEEMD), discrete wavelet transform (DWT), bidirectional long short-term memory (BiLSTM), InceptionV3, Transformer, and fully connected neural networks for wind power prediction. The proposed prediction model can fully capture the nonlinear and dynamic characteristics of the wind speed data that are decomposed into intrinsic modal functions by CEEMD. Simultaneously, the prediction model can capture the local characteristics and cyclic variations of the data at different time scales based on fifth-order DWT. A dataset spanning 378 days from Natal is employed and the proposed model is compared with 19 other networks (e.g., ResNet, GoogleNet, SqueezeNet) and 15 other traditional regression algorithms (e.g., Gaussian process regression, least-angle regression, and nearest neighbors regression). The experimental results show that the designed BiLSTM-InceptionV3-Transformer-fully-connected model (BITFM) can reduce the mean absolute error by at least 38.75 % and the root mean squared error by at least 33.33 % compared with 19 other networks. Compared with 15 traditional regression prediction models, BITFM can reduce the mean absolute error by at least 57.41 % and the root mean squared error by at least 53.95 %. To further validate the prediction performance of BITFM, this study conducts tests employing data from the Santa Vitoria do Palmar. The experimental results indicate that BITFM consistently achieved the best performance metrics.