Utilizing regional air quality models to accurately forecast surface ozone (O3) concentrations, particularly high concentrations, is essential for protecting public health. However, forecasts of air quality model often deviate from site observations due to the limitation of grid resolution and uncertainties from emission sources, meteorological conditions, and chemical reaction mechanisms. Especially, the underestimation is significant under condition of high O3 concentrations. Moreover, such deviations tend to accumulate as forecast lead time increases, compounding the challenges associated with reliable air quality forecast. In this study, we employed AlexNet architecture, a classical convolutional neural network, combined with multiple variables related to meteorology, chemistry, emission and geography to establish a non-linear relationship between grid-scale input variables and site-scale hourly O3 forecast biases in Eastern China, aiming to realize accurate city-level ozone forecast based on a regional air quality prediction model (i.e., Nested Air Quality Prediction Model System, NAQPMS). By assigning weights to high-bias samples and high-concentration samples within the loss function, the proposed Weighted AlexNet model (W_AlexNet) effectively reduced forecast biases and enhanced its capability to predict O3 pollution levels. Compared to NAQPMS, W_AlexNet model demonstrated a 25.71% improvement in RMSE and a 7.17% increase in IOA averagely for hourly O3 (O3-1h) forecasts across four different lead times (24-h, 48-h, 72-h, and 96-h). Notably, W_AlexNet model alleviated the tendency of NAQPMS to underestimate high concentrations and showed a superior performance in improving O3-1h pollution level forecasts, particularly for the 72-h and 96-h lead times. W_AlexNet model can effectively mitigate the bias accumulation effect over increasing lead times, thereby enhancing the reliability of longer-term forecasts. Thus, the W_AlexNet model serves as a post-processing model that can calibrate forecast biases in air quality prediction models, significantly improving the accuracy of O3 high concentration forecasts and providing more precise early warnings of O3 pollution. This underscores its utility in air quality management.