In this study, an in-depth analysis is presented on forecasting aggregated wind power production at the regional level, using advanced Machine-Learning (ML) techniques and feature-selection methods. The main problem consists of selecting the wind speed measuring points within a large region, as the wind plant locations are assumed to be unknown. For this purpose, the main cities (province capitals) are considered as possible features and four feature-selection methods are explored: Pearson correlation, Spearman correlation, mutual information, and Chi-squared test with Fisher score. The results demonstrate that proper feature selection significantly improves prediction performance, particularly when dealing with high-dimensional data and regional forecasting challenges. Additionally, the performance of five prominent machine-learning models is analyzed: Long Short-Term Memory (LSTM) networks, Artificial Neural Networks (ANNs), Support Vector Machines (SVMs), Convolutional Neural Networks (CNNs), and Extreme-Learning Machines (ELMs). Through rigorous testing, LSTM is identified as the most effective model for the case study in northern Italy. This study offers valuable insights into optimizing wind power forecasting models and underscores the importance of feature selection in achieving reliable and accurate predictions.