Long-time discontinuous contact is easy to cause tool wear during milling. To decrease the impact of severe wear on workpiece quality and processing efficiency, cutting tools should be replaced timely. Therefore, tool wear prediction is an important aspect in improving process efficiency, ensuring machining precision and realizing intelligent manufacturing. To boost the precision of online prediction of tool wear, this paper suggests a novel approach to monitor tool wear by optimizing backpropagation (BP) neural network via firefly algorithm (FA). Specifically, the progressive semi-soft threshold function is applied to the process of cutting force signal noise reduction, which reduces redundant signals and noise interference in the signal. Time-domain analysis, frequency-domain analysis, and wavelet packet decomposition are utilized, cutting force features are extracted, and Pearson correlation coefficient is used to sift out signal features that are highly connected to tool wear. The FA is used to improve the BP neural network's weights and thresholds. Through learning nonlinear mapping, relationship between tool flank wear and signal features is realized. A prediction model of tool wear of FA-BP is constructed. Milling experiments validate the prediction model in the milling process. The experimental outcomes confirm the precision and reliability of the method. In comparison to BP neural network, genetic algorithm optimized BP neural network and particle swarm optimization algorithm optimized BP neural network prediction method, it has a greater prediction accuracy and a stronger training impact, and has superior performance. The research results can give a theoretical foundation and technological assistance for predicting tool wear, which is crucial for improving workpiece quality, processing efficiency, and promoting intelligent development.