Abstract
Device to device (D2D) communication is a potential technology in the next generation communication system. We study the mode selection problem in a scenario where the D2D communication has a deadline constraint while the cellular user requires a minimum rate. We formulate the mode selection problem as a Constrained Markov Decision Process (CMDP). By adopting the Lagrangian multiplier technique, we first transform the CMDP into a non-constrained MDP problem. Then, we propose an optimal stationary mode selection algorithm combining the value iteration algorithm and the sub-gradient updating of the Lagrangian multiplier. Further, we propose an optimal randomized mode selection algorithm. Moreover, we contribute on deriving the closed-form policy selection probability. Simulation results have verified the optimality of the proposed algorithm.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.