Accurate forecast of fine particulate matter (PM2.5) is crucial for city air pollution control, yet remains challenging due to the complex urban atmospheric chemical and physical processes. Recently deep learning has been routinely applied for better urban PM2.5 forecasts. However, their capacity to represent the spatiotemporal urban atmospheric processes remains underexplored, especially compared with traditional approaches such as chemistry-transport models (CTMs) and shallow statistical methods other than deep learning. Here we probe such urban-scale representation capacity of a spatiotemporal deep learning (STDL) model for 24-hour short-term PM2.5 forecasts at six urban stations in Rizhao, a coastal city in China. Compared with two operational CTMs and three statistical models, the STDL model shows its superiority with improvements in all five evaluation metrics, notably in root mean square error (RMSE) for forecasts at lead times within 12 h with reductions of 49.8 % and 47.8 % respectively. This demonstrates the STDL model's capacity to represent nonlinear small-scale phenomena such as street-level emissions and urban meteorology that are in general not well represented in either CTMs or shallow statistical models. This gain of small-scale representation in forecast performance decreases at increasing lead times, leading to similar RMSEs to the statistical methods (linear shallow representations) at about 12 h and to the CTMs (mesoscale representations) at 24 h. The STDL model performs especially well in winter, when complex urban physical and chemical processes dominate the frequent severe air pollution, and in moisture conditions fostering hygroscopic growth of particles. The DL-based PM2.5 forecasts align with observed trends under various humidity and wind conditions. Such investigation into the potential and limitations of deep learning representation for urban PM2.5 forecasting could hopefully inspire further fusion of distinct representations from CTMs and deep networks to break the conventional limits of short-term PM2.5 forecasts.