Abstract

Unmanned aerial vehicles (UAVs) with the potential of providing reliable high-rate connectivity, are becoming a promising component of future wireless networks. A UAV collects data from a set of randomly distributed sensors, where both the locations of these sensors and their data volume to be transmitted are unknown to the UAV. In order to assist the UAV in finding the optimal motion trajectory in the face of the uncertainty without the above knowledge whilst aiming for maximizing the cumulative collected data, we formulate a reinforcement learning problem by modelling the motion-trajectory as a Markov decision process with the UAV acting as the learning agent. Then, we propose a pair of novel trajectory optimization algorithms based on stochastic modelling and reinforcement learning, which allows the UAV to optimize its flight trajectory without the need for system identification. More specifically, by dividing the considered region into small tiles, we conceive state-action-reward-state-action (Sarsa) and $Q$-learning based UAV-trajectory optimization algorithms (i.e., SUTOA and QUTOA) aiming to maximize the cumulative data collected during the finite flight-time. Our simulation results demonstrate that both of the proposed approaches are capable of finding an optimal trajectory under the flight-time constraint. The preference for QUTOA vs. SUTOA depends on the relative position of the start and the end points of the UAVs.

Highlights

  • In the emerging Internet of Everything (IoE), future networks are expected to autonomously determine the connection of people, processes and things

  • Since a virtual reward will be received by the unmanned aerial vehicles (UAVs) when it arrives at the destination, the sum of rewards is higher than the true amount of the collected data

  • In this article, we invoked the reinforcement learning for the UAV’s trajectory optimization with the goal of maximizing the cumulative data volume fetched from the sensors, where the network information are unavailable to the UAV, such as the locations of the sensors and the amount of data to be transmitted

Read more

Summary

Introduction

In the emerging Internet of Everything (IoE), future networks are expected to autonomously determine the connection of people, processes and things. The exciting applications of unmanned aerial vehicles (UAVs) or drones have drawn considerable attention from academia, industry and regulatory bodies, for expanding the attainable communication coverage and offering on-demand connectivity [1]. Smart UAV-assisted solutions are capable of enhancing nextgeneration wireless networks. A range of professional and civil applications of UAVs have been envisioned, including parcel delivery, communications and media, inspection of critical infrastructure, communication relaying, search-andrescue operations, and surveillance, among others [2], [3]. Some off-the-shelf UAVs have a recharge-duration of less than 20 minutes, and a flight range of about 15 miles [4].

Objectives
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.