Abstract

An urban traffic light scheduling problem (UTLSP) is studied by using problem feature based meta-heuristics with Q-learning. The goal is to minimize the network-wise total delay time within a time window by finding a high-quality schedule of traffic lights. First, a dynamic flow model is used to describe the UTLSP in a scheduling framework. Second, four improved meta-heuristics combining Q-learning are proposed, including harmony search (HS), water cycle algorithm (WCA), Jaya, and artificial bee colony (ABC) algorithms. Five problem feature based local search operators are constructed. During the iterative process, Q-learning is employed to select the local search operators with strong competitiveness. Two ensemble strategies are proposed to combine meta-heuristics and Q-learning. Finally, experiments are conducted based on real traffic data. The performance of the improved meta-heuristics with Q-learning is verified by solving eighteen cases with different scales. Numerical results and comparisons show that the proposed algorithms have statistical improvements over their peers. The proposed feature-based ABC with Q-learning has the strongest competitiveness among all compared ones.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.