As one of the most neglected zoonotic diseases, brucellosis has posed a serious threat to public health worldwide. This study is purposed to apply different machine learning models to improve the prediction accuracy of human brucellosis (HB) in Shaanxi, China from 2008 to 2020, under livestock husbandry intensification from a spatiotemporal perspective. We quantitatively evaluated the performance and suitability of ConvLSTM, RF, and LSTM models in epidemic forecasting, and investigated the spatial heterogeneity of how different factors drive the occurrence and transmission of HB in distinct sub-regions by using Kernel Density Analysis and Shapley Additional Explanations. Our findings demonstrated that ConvLSTM network yielded the best predictive performance with the lowest average RMSE of 13.875 and MAE values of 18.393. RF model generated an underestimated outcome while LSTM model had an overestimated one. In addition, climatic conditions, intensification of livestock keeping and socioeconomic status were identified as the dominant factors that drive the occurrence of HB in Shaanbei Plateau, Guanzhong Plain, and Shaannan Region, respectively. This work provided a comprehensive understanding of the potential risk of HB epidemics in Northwest China driven by both anthropogenic activities and natural environment, which can support further practice in disease control and prevention.