Abstract

Distributed three-stage assembly scheduling problem extensively exists in the real-life assembly production process and is seldom considered. The integration of reinforcement learning with meta-heuristic can effectively improve the performance of meta-heuristic and effectively solve the problem; however, the integration is seldom used to cope with the problem. In this study, distributed three-stage assembly scheduling problem with DPm→1 layout and maintenance at three stages is considered and a mathematical model is provided. A new artificial bee colony with Q-learning (QABC) is proposed to minimize maximum tardiness. An effective Q-learning algorithm is implemented to dynamically select search operator, which consists of 12 states based on population quality evaluation, 8 actions defined by global search and neighborhood search, a new reward and an effective action selection. Two employed bee swarms are formed, an adaptive communication and an adaptive competition process between them are adopted to intensify exploration ability and improve search efficiency. QABC and its four comparative algorithms are tested on 80 instances. The computational results demonstrate that the new strategies of QABC really improve its search performance and QABC is a competitive algorithm for the considered problem.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.