Abstract

Designing an effective and efficient end-to-end optimization framework with good generalization for shop scheduling is an emerging topic in the informational manufacturing system. Existing end-to-end frameworks have achieved satisfactory results for COPs such as traveling salesman problem and vehicle routing problem. However, the performances of these methods in solving complex COPs such as shop scheduling need to be improved. In this paper, a knowledge-guided end-to-end optimization framework based on reinforcement learning is proposed to solve the permutation flow shop scheduling problem (PFSP). Firstly, a new policy network is designed based on the problem characteristics to deal with different scales of PFSPs and achieve iterative end-to-end generation. Secondly, an improved policy-based reinforcement learning algorithm by using the knowledge accumulated during the training process is designed to enhance the training quality. Thirdly, a knowledge-guided improvement strategy is introduced through the cooperation of local search and supervised learning to improve the learning of the policy. Simulation results and comparisons show that the knowledge-guided end-to-end optimization framework can obtain better results than different kinds of commonly-used optimization methods in limited computation time for solving the PFSP.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call