Short-term water demand forecasting (STWDF) for multiple spatially and temporally correlated District Metering Areas (DMAs) is an essential foundation for achieving more refined management of urban water supply networks. However, due to the greater uncertainty associated with specific DMA demand compared to overall water usage, accurately predicting STWDF poses significant challenges. This study introduces an innovative network architecture—the multi-scale correction module neural network, built upon Long Short-Term Memory (LSTM) networks and Convolutional Neural Networks (CNN) enhanced with Attention mechanisms—for simultaneous STWDF with a temporal resolution of one hour over a week for 10 DMAs located in a single city in northern Italy. This framework utilizes multivariate corrections to refine and enhance the output accuracy. The results reveal that, in comparison to traditional Gated Recurrent Unit or LSTM models, the proposed model with integrated correction modules, particularly those that leverage inter-DMA correlations, improves performance across all evaluation metrics by an average of 5%-20% per DMA. Additionally, it consistently delivers superior accuracy across three scenarios: single DMA forecasting, total water demand, and extreme conditions, while maintaining stable performance throughout. Furthermore, the interpretability analysis underscores the feasibility of this innovative structure and highlights the contribution of meteorological features to the predictive model in some DMA-level STWDF. The unified input-output framework elegantly simplifies the STWDF process across multiple DMAs, providing new insights and methodologies for future research in this domain.