Forecasting wind power is crucial to ensuring the electricity system runs steadily. However, wind power forecasting is hampered by the randomness and volatility of wind power data. In this paper, a deterministic and probabilistic decomposition ensemble model based on improved feature screening and optimal Gaussian mixed kernel function is proposed, which mainly includes feature screening, feature learning, intelligent weighted integration and interval prediction modules. First, an improved feature screening (IFS) is proposed to capture the optimal features of the original wind power data, eliminating the interference of redundant features. Then an optimized deep neural network is suggested to learn the outcomes of IFS and further explore the temporal correlation of these features. Then, considering the difference of the contribution of each feature component, the final result is obtained by integrating the preliminary forecast results using intelligent weighted integration (IWI). Finally, an optimal mixed kernel-based Gaussian process regression model is developed for interval prediction, which improves the adaptability of the model. The data sets of two wind farms in Inner Mongolia were used for empirical testing. On the two datasets, the model's root mean square error (RMSE) is 1.1845 and 1.7354, respectively, both of which are lower than the comparison model. The model has the lowest coverage width criterion (CWC) of 0.1499 and 0.1460 on the two datasets, respectively.