Abstract

Autonomous Intersection Management (AIM) for high-level Connected and Automated Vehicles (CAVs) has evolved from rule-based to optimisation-based policies. However, at congested major-minor intersections, optimising solely for efficiency can negatively impact vehicle fairness. This study addresses this issue by proposing a deep reinforcement learning approach that optimises both traffic efficiency and fairness for AIM. In the modelled multi-objective Markov decision process, traffic fairness is measured by the difference between the crossing order and the approaching order of CAVs, while traffic efficiency is measured by average travel time. With unknown preferences of the objectives, Bellman optimality equation is generalised to obtain the optimal policies over the space of all possible preferences during the iterative training process. The effectiveness of the proposed method is evaluated in a simulated real-world intersection and compared with three benchmark policies, including the fairest policy for AIM: first-come-first-served. The learned policies perform best in reducing overall average vehicle delay, and demonstrate outstanding performance in balancing traffic fairness and efficiency.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call