Abstract
A parallel server system is considered, with I customer classes and many servers, operating in a heavy traffic diffusion regime where the queueing delay and service time are of the same order of magnitude. Denoting by $\widehat{X}^{n}$ and $\widehat{Q}^{n}$ , respectively, the diffusion scale deviation of the headcount process from the quantity corresponding to the underlying fluid model and the diffusion scale queue-length, we consider minimizing r.v.'s of the form $c^{n}_{X}=\int_{0}^{u}C(\widehat{X}^{n}(t))\,dt$ and $c^{n}_{Q}=\int_{0}^{u}C(\widehat{Q}^{n}(t))\,dt$ over policies that allow for service interruption. Here, C:? I ??+ is continuous, and u>0. Denoting by ? the so-called workload vector, it is assumed that $C^{*}(w):=\min\{C(q):q\in\mathbb{R}_{+}^{\mathbf{I}},\theta\cdot q=w\}$ is attained along a continuous curve as w varies in ?+. We show that any weak limit point of $c^{n}_{X}$ stochastically dominates the r.v. $\int_{0}^{u}C^{*}(W(t))\,dt$ for a suitable reflected Brownian motion W and construct a sequence of policies that asymptotically achieve this lower bound. For $c^{n}_{Q}$ , an analogous result is proved when, in addition, C ? is convex. The construction of the policies takes full advantage of the fact that in this regime the number of servers is of the same order as the typical queue-length.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.