Datacenter applications desire low latency for short messages to provide a better user experience. Therefore, one of the goals of datacenter networks is to minimize flow completion time (FCT), especially for short flows. Multiple scheduling disciplines have been proposed to achieve this goal. In this paper, we develop a Dynamic Longer Stay Less Priority (D-LSLP) which looks at the current queue occupancy to adjust the demotion threshold for the packets of the flows arriving in the strict priority queues. Initially, D-LSLP considers every flow to be a short flow, and with the passage of time, the flow is demoted to the lower priority queue, similar to Multilevel Feedback Queue (MLFQ) Scheduling. It enables short flows to be completed in a couple of higher-priority queues. In contrast, large flows are demoted to the lower priority queues after remaining active in the queues for a certain amount of time (demotion threshold). However, for the different traffic patterns with different distribution of flow lengths, the demotion threshold for the flows should be adapted automatically. In this paper, we leverage the programmable nature of the P4 switches to measure the instant queue occupancy in the P4, which can be used to adjust the demotion threshold accordingly for the highest priority queue. This enables D-LSLP to increase or decrease its highest priority demotion threshold based on the instant queue status. D-LSLP allows multiple traffic patterns to coexist without manually tuning the demotion thresholds. Furthermore, it reduces the tail drop in the highest priority queue when many short flows overwhelm the highest priority queue. The performance evaluation shows that it works equally well for different traffic patterns without operator intervention.
Read full abstract