Abstract

Section III-F in [1] made formal claims about the limiting behavior of the controller (as time and space grow to infinity) whose proof relied on a stochastic network optimization theorem which may not apply. Specifically, we cannot make the claim that the optimal value of $\overline{Y}$ achieved by a one-step local controller and the constraints on time-averages can both be achieved arbitrarily closely by the controller given by (16). This is because to prove this claim and demonstrate that this controller stabilizes all queues as $t \rightarrow \infty$, the theorem (see drift-plus-penalty method [2]) requires that at each time slot there exist a decision that causes the size of the virtual queues to decrease, or remain constant, in expectation. This requirement is not satisfied as the robot may reach a state where all possible actions increase the queues sizes. Note however that the claim that our controller will achieve a bounded queuing delay still holds in the more realistic context of finite time and/or space, and the performance gains of the proposed controller remain valid in practice.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call