Recent load forecasting techniques combining machine learning models and hyperparameter optimization algorithms have shown success for short-term load forecasting (STLF) task, but they often require complex programming, higher computational costs, and greater parameter tuning. In this paper, we introduce an improved STLF model that combines Long Short-Term Memory (LSTM) neural network with Particle Swarm Optimization (PSO) for enhanced performance. In the proposed approach, the number of hidden neurons in different LSTM layers, learning rate and the number of iterations for training are optimized using the PSO algorithm. To validate the effectiveness of this method, meteorological data and historical load data from a real-world power grid are used as input. The experimental results reveal PSO significantly enhances hyperparameter tuning for LSTM neural networks, leading to improved predictive modelling. The PSO-LSTM model performed better than the LSTM model by more than 20% (in terms of Mean Absolute Error), and showed low sensitivity to hyperparameters. Comparative analysis with alternative approaches from the literature further validates the PSO-LSTM’s effectiveness in STLF. Additionally, the model achieved stable multi-step prediction capabilities, with average errors of 3.6445 for MAE, 4.6509 for RMSE, and 4.6519 for MAPE over a 1–4 day ahead lead times. This study highlights PSO-LSTM’s enhanced robustness and accuracy in power load prediction while addressing hyperparameter tuning challenges through self-optimization.