AV-FDTI: Audio-visual fusion for drone threat identification

Yizhuo Yang,Shenghai Yuan,Jianfei Yang,Thien Hoang Nguyen,Muqing Cao,Thien-Minh Nguyen,Han Wang,Lihua Xie

doi:10.1016/j.jai.2024.06.002

Abstract

In response to the evolving challenges posed by small unmanned aerial vehicles (UAVs), which have the potential to transport harmful payloads or cause significant damage, we present AV-FDTI, an innovative Audio-Visual Fusion system designed for Drone Threat Identification. AV-FDTI leverages the fusion of audio and omnidirectional camera feature inputs, providing a comprehensive solution to enhance the precision and resilience of drone classification and 3D localization. Specifically, AV-FDTI employs a CRNN network to capture vital temporal dynamics within the audio domain and utilizes a pretrained ResNet50 model for image feature extraction. Furthermore, we adopt a visual information entropy and cross-attention-based mechanism to enhance the fusion of visual and audio data. Notably, our system is trained based on automated Leica tracking annotations, offering accurate ground truth data with millimeter-level accuracy. Comprehensive comparative evaluations demonstrate the superiority of our solution over the existing systems. In our commitment to advancing this field, we will release this work as open-source code and wearable AV-FDTI design, contributing valuable resources to the research community.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AV-FDTI: Audio-visual fusion for drone threat identification

Abstract

Talk to us

Similar Papers

More From: Journal of Automation and Intelligence

Lead the way for us

Journal: Journal of Automation and Intelligence	Publication Date: Jun 25, 2024
License type: cc-by-nc-nd

Similar Papers

Unmanned Aerial Vehicle (UAV) Dynamic-Tracking Directional Wireless Antennas for Low Powered Applications that Require Reliable Extended Range Operations in Time Critical Scenarios
Scott G Bauer ... James R Hanneman
-
Scott G Bauer, et. al. Scott G Bauer ... James R Hanneman
01 Oct 2005
01 Oct 2005

CAMERA CALIBRATION FOR UAV APPLICATION USING SENSOR OF MOBILE CAMERA
Y Takahashi ... H Chikatsu
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XL-4/W5
Y Takahashi, et. al.Y Takahashi ... H Chikatsu
13 May 2015
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XL-4/W5

Investigation of Fish-Eye Lenses for Small-UAV Aerial Photography
A Gurtner ... R Glassock
IEEE Transactions on Geoscience and Remote Sensing | VOL. 47
A Gurtner, et. al.A Gurtner ... R Glassock
01 Mar 2009
IEEE Transactions on Geoscience and Remote Sensing | VOL. 47

Task-Based Network Reconfiguration in Distributed UAV Swarms: A Bilateral Matching Approach
Dianxiong Liu ... Zhiyong Du
IEEE/ACM Transactions on Networking | VOL. 30
Dianxiong Liu, et. al.Dianxiong Liu ... Zhiyong Du
01 Dec 2022
IEEE/ACM Transactions on Networking | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AV-FDTI: Audio-visual fusion for drone threat identification

Abstract

Talk to us

Similar Papers

More From: Journal of Automation and Intelligence