Abstract

We consider a many-server queue in which each server can serve multiple customers in parallel. Such multitasking phenomena occur in various applications areas (e.g., in hospitals and contact centers), although the impact of the number of customers who are simultaneously served on system efficiency may vary. We establish diffusion limits of the queueing process under the quality-and-efficiency-driven scaling and for different policies of assigning customers to servers depending on the number of customers they serve. We show that for a broad class of routing policies, including routing to the least busy server, the same one-dimensional diffusion process is obtained in the heavy-traffic limit. In case of assignment to the most busy server, there is no state-space collapse, and the diffusion limit involves a custom regulator mapping. Moreover, we also show that assigning customers to the least (most) busy server is optimal when the cumulative service rate per server is concave (convex), motivating the routing policies considered. Finally, we also derive diffusion limits in the nonheavy-traffic scaling regime and in the heavy-traffic scaling regime where customers can be reassigned during service. Funding: The research of J. Storm is partly funded by the Netherlands Organization for Scientific Research (NWO) Gravitation project Networks [Grant 024.002.003].

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.