Research on automatic pilot repetition generation method based on deep reinforcement learning.

Weijun Pan,Junxiang Huang,Peiyuan Jiang,Yukun Li,Zhuang Wang

doi:10.3389/fnbot.2023.1285831

Abstract

Using computers to replace pilot seats in air traffic control (ATC) simulators is an effective way to improve controller training efficiency and reduce training costs. To achieve this, we propose a deep reinforcement learning model, RoBERTa-RL (RoBERTa with Reinforcement Learning), for generating pilot repetitions. RoBERTa-RL is based on the pre-trained language model RoBERTa and is optimized through transfer learning and reinforcement learning. Transfer learning is used to address the issue of scarce data in the ATC domain, while reinforcement learning algorithms are employed to optimize the RoBERTa model and overcome the limitations in model generalization caused by transfer learning. We selected a real-world area control dataset as the target task training and testing dataset, and a tower control dataset generated based on civil aviation radio land-air communication rules as the test dataset for evaluating model generalization. In terms of the ROUGE evaluation metrics, RoBERTa-RL achieved significant results on the area control dataset with ROUGE-1, ROUGE-2, and ROUGE-L scores of 0.9962, 0.992, and 0.996, respectively. On the tower control dataset, the scores were 0.982, 0.954, and 0.982, respectively. To overcome the limitations of ROUGE in this field, we conducted a detailed evaluation of the proposed model architecture using keyword-based evaluation criteria for the generated repetition instructions. This evaluation criterion calculates various keyword-based metrics based on the segmented results of the repetition instruction text. In the keyword-based evaluation criteria, the constructed model achieved an overall accuracy of 98.8% on the area control dataset and 81.8% on the tower control dataset. In terms of generalization, RoBERTa-RL improved accuracy by 56% compared to the model before improvement and achieved a 47.5% improvement compared to various comparative models. These results indicate that employing reinforcement learning strategies to enhance deep learning algorithms can effectively mitigate the issue of poor generalization in text generation tasks, and this approach holds promise for future application in other related domains.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in neurorobotics	Publication Date: Oct 11, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Research on automatic pilot repetition generation method based on deep reinforcement learning.

Abstract

Talk to us

Similar Papers

More From: Frontiers in neurorobotics

Lead the way for us

Similar Papers

A Survey of Multi-Task Deep Reinforcement Learning
Nelson Vithayathil Varghese ... Qusay H Mahmoud
Electronics | VOL. 9
Nelson Vithayathil Varghese, et. al.Nelson Vithayathil Varghese ... Qusay H Mahmoud
22 Aug 2020
Electronics | VOL. 9

Sample effficient deep reinforcement learning for control

-

15 Dec 2019
15 Dec 2019

E xploration E xploitation Problem in Policy Based Deep Reinforcement Learning for Episodic and Continuous Environments
Vedang Naik ... Sameer Mahajan
International Journal of Engineering and Advanced Technology | VOL. 11
Vedang Naik, et. al.Vedang Naik ... Sameer Mahajan
30 Dec 2021
International Journal of Engineering and Advanced Technology | VOL. 11

How are reinforcement learning and deep learning algorithms used for big data based decision making in financial industries–A review and research agenda
Vinay Singh ... Agam Gupta
International Journal of Information Management Data Insights | VOL. 2
Vinay Singh, et. al.Vinay Singh ... Agam Gupta
28 Jun 2022
International Journal of Information Management Data Insights | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Research on automatic pilot repetition generation method based on deep reinforcement learning.

Abstract

Talk to us

Similar Papers

More From: Frontiers in neurorobotics