Abstract

This paper investigates energy efficiency issues of device-to-device (D2D) communications in heterogeneous networks. To minimize the total transmit power, an approach based on Q-learning together with adaptive ε -greedy is proposed to optimize the connection of user equipment (UE) with base station (BS) or Access point (AP). The proposed adaptive ε greedy can conduct the adequate exploration and exploitation operations for effective optimization. Simulation results indicate that in the single-cell scenario, the proposed adaptive ε-greedy can obtain performance close to the best solution.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.