Abstract

This paper deals with the autopilot design methodology for UCAVs (Unmanned Combat Aerial Vehicles) using the learning-based approach. In this study, the acceleration control is considered for the UCAV’s autopilot design, and the reinforcement learning, especially deep deterministic policy gradient (DDPG) method, is used to design the autopilot. To this end, we establish the autopilot design problem based on the reinforcement learning framework. First, we define an actor, an environment, an observation, and a reward. And then, we design appropriate observation and reward function for the autopilot design problems using the reinforcement learning framework. In order to validate the proposed method, the numerical simulation is performed using tensor flow. The potential importance of the proposed approach is that it can learn the autopilot algorithm from scratch without any model information of the UCAV systems. Therefore, the proposed method can allow to significantly reduce time and effort for identifying the model information in the autopilot design process. This ability of the proposed method could also be useful for improving the safety of the systems against the unexpected system failure or damage of the UCAVs during their operations.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.