Optimization of green signal timing for each phase at a signalized intersection in an urban area is critical for efficacious traffic management and congestion mitigation. Many algorithms are developed, yet very few are for cities in developing nations where traffic is characterized by its heterogeneous nature. While some of the recent studies have explored different variants of Max Pressure (MP) and Reinforcement Learning (RL) for optimizing phase timing, the focus is limited to homogeneous traffic conditions. In developing nations, such as India, control systems, like fixed and actuated, are still predominantly used in practice. Composite Signal Control Strategy (CoSiCoSt) is also employed at a few intersections. However, there is a notable absence of advanced models addressing heterogeneous traffic behavior, which have a great potential to reduce delays and queue lengths. The present study proposes a hybrid algorithm for an adaptive traffic control system for real-world heterogeneous traffic conditions. The proposed algorithm integrates Max Pressure with Reinforcement Learning. The former dynamically determines phase order by performing pressure calculations for each phase. The latter optimizes the phase timings for each phase to minimize delays and queue lengths using the proximal policy optimization algorithm. In contrast to the past RL models, in which the phase timing is determined for all phases at once, in the proposed algorithm, the phase timings are determined after the execution of every phase. To assess the impact, classified traffic volume is extracted from surveillance videos of an intersection in Ludhiana, Punjab and simulated using Simulation of Urban Mobility (SUMO). Traffic volume data is collected over three distinct time periods of the day. The results of the proposed algorithm are compared with benchmark algorithms, such as Actuated, CoSiCoSt, and acyclic & cyclic Max Pressure, Reinforcement Learning-based algorithms. To assess the performance, queue length, delay, and queue dissipation time are considered as key performance indicators. Of actuated and CoSiCoSt, the latter performs better, and thus, the performance of the proposed hybrid algorithm is compared with CoSiCoSt. The proposed algorithm effectively reduces total delay and queue dissipation time in the range of 77.07%–87.66% and 53.95%–62.07%, respectively. Similarly, with respect to the best-performing RL model, the drop in delay and queue dissipation time range from 55.63 to 77.12% and 22.13 to 43.7%, respectively, which is significant at 99% confidence level. The proposed algorithm is deployed on a wireless hardware architecture to confirm the feasibility of real-world implementation. The findings highlight the algorithm’s potential as an efficient solution for queues and delays at signalized intersections, where mixed traffic conditions prevail.