Tool wear is critically important for the optimization of cutting parameters. However, the increasing nature of tool wear presents challenges to traditional meta-heuristic cutting parameter optimization methods. To address this issue, we propose an innovative deep reinforcement learning-driven cutting parameters adaptive optimization method taking tool wear into account. More specifically, we use the Markov Decision Process to simulate the optimization process of cutting parameters. Firstly, an innovative deep transfer learning algorithm is used for monitoring tool wear. With the progress of tool wear, the proximal policy optimization method of the transformer with multi-head attention mechanism interacts with the processing environment through a process of trial and error, and accumulates a wealth of experience in selecting cutting parameters through the reward function. The deep reinforcement learning model has quickly discern the best cutting parameters, relying on real-time tool wear value. The experimental results show that the proposed method outperforms other algorithms.