The environmental-friendly production demands higher manufacturing efficiency and lower energy cost; therefore, time-of-use electricity price constraint and distributed production have attracted more attention. For concrete precast architecture construction, the tardiness penalty and warehouse cost cannot be ignored, which should be optimized for lower cost. In this study, the concrete precast process is investigated as the group scheduling of a distributed flexible job shop problem. Each concrete precast is managed in a group, the setup time between different groups is considered. Two objectives, total weighted earliness and tardiness and total time-of-use electricity cost are minimized, simultaneously. To solve the integrated problem, three coordinated double deep Q-networks (DQN) are applied, which are organized as a learn-to-improve reinforcement learning approach. For distributed scheduling problem, operators in a single factory or in multiple factories differs in solution improvement; so selection DQN is designed to decide the type of the operators according to scheduling circumstances. Other two DQNs, i.e., local DQN and global DQN, are to select the optimization operators in one factory or in multiple factories, respectively. Furthermore, two solution refinement strategies are designed to decrease the objectives after reinforcement learning component. Numerical experiment and statistical analysis suggest that the proposed deep reinforcement learning based algorithm has superiority in solving the considered problem.
Read full abstract