Traffic congestion is a major factor currently affecting the global economy thereby costing billions of dollars in terms of the time lost in traffic, productivity and fuel wastage. Although, various researchers have made concerted effort to proffer solution to the problems of traffic congestion, in addition, recent reinforcement learning research has also presented multiple possible solutions. However, based on the literature, most of the reinforcement learning traffic control agents do not possess the real-time traffic control capabilities to efficiently deal with the complex and ever-changing nature of traffic on an average urban city road. This paper presents a Real Time deep reinforcement learning model for Traffic Control for optimization of vehicle flow (minimizing waiting time and traffic congestion). The Deep SARSA (DSARSA) learning algorithm with experience replay applied in the training process. The DSARSA replay model is built to extract traffic information from all lanes by applying multilayer perceptron with two hidden layers. Then, at the output layer an approximated qvalue is produced and updated with SARSA algorithm. For training, a random batch of experiences is sampled from the replay buffer. This helps break the correlation between consecutive experiences, stabilizes training and enables the model to be to able deal appropriately with any traffic condition. Experiments on 1000 episode of low, medium and heavy traffic flow was conducted on SUMO traffic simulator. TraCL allows interface with the SUMO simulation using python showed that the proposed DSARSA replay model achieved a state-of-the-art performance compared to other baseline DRL models applied in traffic control in terms of learning stability and reward maximization.
Read full abstract