Accurate tool wear prediction during machining is crucial to manufacturing since it will significantly influence tool life, machining efficiency, and workpiece quality. Although existing data-driven methods can achieve competitive performance in tool wear prediction, their main emphasis is on fixed operating conditions with sufficient training samples, which is impractical in engineering practice. This implies that predicting tool wear values under variable working conditions with insufficient data is still a challenge owing to the difference in data distributions in complex tool wear mechanisms. Besides, having no access to samples in new conditions is another challenge for tool wear prediction in engineering practice. To address these issues, we develop a hybrid information model-agnostic domain generalization (H-MADG) method to provide appropriate initial model parameters that can be fast adaptative to the new conditions after fine-tuning. Additionally, we construct hybrid information as model input by fusing process information with temporal properties derived by neural networks, and the hybrid information can offer more useful prior knowledge about the machining process. Experimental results on NASA milling data show that compared with contrastive techniques, the RMSE of the proposed H-MADG method is reduced by an average of 36.81 %, which can achieve a low average RMSE value of 0.0904 with 15 cases under five different network architectures. We also investigate several crucial impact factors of the H-MADG method and summarize corresponding analysis and suggestions.