In the automobile manufacturing process, the complexity of the production environment and the different objectives of the upstream and downstream shops necessitate the resequencing of the car sequences using buffers. This paper addresses a car resequencing problem in automotive assembly shops by utilising selectivity banks to reshuffle the given upstream sequences to minimise the number of violations of sequencing rules in the downstream sequence. A new resequencing approach combining heuristics and deep reinforcement learning is proposed to solve the problem. Heuristics solve the automobile storage problem, while a deep reinforcement learning method based on proximal policy optimisation addresses the automobile release problem. Three state matrices, agent actions, and the reward function are designed based on the characteristics of the objective and selectivity banks. Additionally, a novel network structure combining convolutional neural networks and long short-term memory networks is proposed to train the agent. The results demonstrate that the proposed method outperforms two efficient deep reinforcement learning algorithms and two heuristics and obtains better solutions in handling dynamic events.
Read full abstract