Abstract

This work develops a path reinforcement learning planning algorithm for unmanned aerial vehicles (UAVs) based on the probability distribution of dynamic targets. The proposed algorithm enables the effective interception of aerial dynamic invasive targets into nuclear power plants. In accordance with the status information of the invasive target, the target probability distribution is calculated by the probability diffusion principle and the target's possible location is deduced based on the obtained distribution. On this basis, the action space based on path point transfer rules is established and the dynamic updating mechanism of the reward function based on the target probability distribution is designed. Using $Q$-learning for continuous path optimization, a UAV path planning framework based on the target probability distribution and reinforcement learning is constructed, so as to realize the UAV path planning. Simulation results indicate that the proposed method can enable the autonomous path planning of UAVs for intercepting aerial dynamic invasive targets into nuclear power plants when the target point changes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.