Abstract

This paper proposes a reinforcement learning (RL) based Q-learning to address the issues of joint spectrum prediction (SP) and device-to-device (D2D) data communication in cognitive radio (CR) framework. An optimization problem is formulated that addresses energy efficiency (EE) maximization of D2D communications under the constraints of its total transmit power and a certain data transmission rate while meeting an interference threshold and cooperation rate in primary user (PU) transmission. The high accuracy in SP offers reward as an improvement on EE while a compulsion of meeting an interference threshold and a penalty on PU data transmission are made based on the relative degree of wrong prediction. A large set of simulation results shows that the proposed method offers 30% gain in EE while 20% reduction in data collision with PU over the existing works.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.