An accurate forecast of short-term reference evapotranspiration (ET0) is crucial for effective farm and irrigation scheduling management. The techniques of forecasting ET0 have progressed from empirical equations to artificial intelligence-based models. However, gaps have remained in comprehensively evaluating the ET0 forecast performance discrepancies between various methods based on weather forecasts in addition to historical meteorological data, as well as investigating the effect of the amount of input variables on ET0 forecast accuracy. Here, the study evaluated the forecast accuracy of five conventional equations calibrated and six machine learning (ML) and deep learning (DL) models in predicting daily ET0 for a lead time of 1–7 days in the North China Plain (NCP). Comparative evaluations indicate that the Analytical Penman-Monteith (APM) is the optimum empirical equation for forecasting ET0, but its accuracy decreases as lead times increase. Conversely, ML and DL models consistently outperform empirical equations, characterized by lower mean absolute error (MAE) and root mean square error (RMSE) as well as correlation coefficient and Nash-Sutcliffe efficiency coefficient approaching 1 across all lead times. Notably, the Bidirectional Long short-term memory (Bi-LSTM) model outperforms other models, maintaining robust and effective forecast performance even on day 7. Moreover, with the increasing input variables, combining weather forecasts and historical meteorological data, the forecast performance of all models is improved, with significant enhancement observed in the Bi-LSTM model. This work provides valuable insights for choosing appropriate models to forecast ET0, holding essential in regional managing farm and irrigation systems.