Military Image Captioning for Low-Altitude UAV or UGV Perspectives

Lizhi Pan,Chengtian Song,Keyu Xu,Yue Xie,Xiaozheng Gan

doi:10.3390/drones8090421

Abstract

Low-altitude unmanned aerial vehicles (UAVs) and unmanned ground vehicles (UGVs), which boast high-resolution imaging and agile maneuvering capabilities, are widely utilized in military scenarios and generate a vast amount of image data that can be leveraged for textual intelligence generation to support military decision making. Military image captioning (MilitIC), as a visual-language learning task, provides innovative solutions for military image understanding and intelligence generation. However, the scarcity of military image datasets hinders the advancement of MilitIC methods, especially those based on deep learning. To overcome this limitation, we introduce an open-access benchmark dataset, which was termed the Military Objects in Real Combat (MOCO) dataset. It features real combat images captured from the perspective of low-altitude UAVs or UGVs, along with a comprehensive set of captions. Furthermore, we propose a novel encoder–augmentation–decoder image-captioning architecture with a map augmentation embedding (MAE) mechanism, MAE-MilitIC, which leverages both image and text modalities as a guiding prefix for caption generation and bridges the semantic gap between visual and textual data. The MAE mechanism maps both image and text embeddings onto a semantic subspace constructed by relevant military prompts, and augments the military semantics of the image embeddings with attribute-explicit text embeddings. Finally, we demonstrate through extensive experiments that MAE-MilitIC surpasses existing models in performance on two challenging datasets, which provides strong support for intelligence warfare based on military UAVs and UGVs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Military Image Captioning for Low-Altitude UAV or UGV Perspectives

Abstract

Talk to us

Similar Papers

More From: Drones

Lead the way for us

Journal: Drones	Publication Date: Aug 24, 2024
License type: CC BY 4.0

Similar Papers

Towards collaboration between unmanned aerial and ground vehicles for precision agriculture
Subodh Bhandari ... Dat Do
-
Subodh Bhandari, et. al.Subodh Bhandari ... Dat Do
08 May 2017
08 May 2017

Coordination Between Unmanned Aerial and Ground Vehicles: A Taxonomy and Optimization Perspective.
Jie Chen ... Xing Zhang
IEEE Transactions on Cybernetics | VOL. 46
Jie Chen, et. al.Jie Chen ... Xing Zhang
17 Apr 2015
IEEE Transactions on Cybernetics | VOL. 46

Theory and experiment on distributed output formation tracking of unmanned aerial and ground vehicle swarm systems over jointly connected digraphs
Peixuan Shu ... Zhang Ren
Control Engineering Practice | VOL. 152
Peixuan Shu, et. al.Peixuan Shu ... Zhang Ren
29 Aug 2024
Control Engineering Practice | VOL. 152

World representations for unmanned vehicles
Gregory S Broten ... Jack Collier
-
Gregory S Broten, et. al.Gregory S Broten ... Jack Collier
27 Apr 2007
27 Apr 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Military Image Captioning for Low-Altitude UAV or UGV Perspectives

Abstract

Talk to us

Similar Papers

More From: Drones