Abstract
We propose an adaptive reinforcement learning (A-RL) framework to maximize the sum-rate for non-orthogonal multiple access-unmanned aerial vehicle (NOMA-UAV) network. In this framework, Mamdani fuzzy inference system (MFIS) supervises a reinforcement learning (RL) policy based on multi-armed bandits (MAB). UAV as learning agent serves an internet of things (IoT) region. It manages an interference affected, channel block for NOMA uplink. Sum-rate, rate outage probability and average bit error rate (BER) for far-user are compared. Simulations reveal superior performance of A-RL, compared to non-adaptive RL counterpart. Joint maximum likelihood detection (JMLD) and successive interference cancellation (SIC) are also compared for BER performance and implementation complexity.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.