A complementary dual-backbone transformer extracting and fusing weak cues for object detection in extremely dark videos

Bo Zhang,Jinli Suo,Qionghai Dai

doi:10.1016/j.inffus.2023.101822

Abstract

Reliable object detection under dark environment is of wide applications but severely challenged by heavy noise washing out informative features and uneven radiance caused by nighttime illuminations. These unique features of dark videos would largely degenerate the performance of existing detectors. To address this issue, specially designed algorithms being able to extract and fuse the weak features buried in the low-quality videos are of vital importance. Bearing these in mind, we propose illumination-aware spatio-temporal feature fusion modules for low-light video object detection and implement a Dark Video Detector under a TRansformer network structure, dubbed as DVD-TR. Firstly, we use a dual-backbone Transformer to extract separate complementary features and fuse them to strengthen the network’s feature extraction capability. Secondly, we incorporate a spatio-temporal sampling mechanism to aggregate features from multiple frames, which can enhance detection accuracy in dark videos. Thirdly, we use a small encoder–decoder network to obtain irradiance distribution which is further incorporated for illumination-aware feature fusion. Extensive experiments on large-scale multi-illuminance dark video benchmark show that DVD-TR outperforms state-of-the-art video detectors by a large margin and validate the effectiveness of the proposed approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A complementary dual-backbone transformer extracting and fusing weak cues for object detection in extremely dark videos

Abstract

Talk to us

Similar Papers

More From: Information Fusion

Lead the way for us

Journal: Information Fusion	Publication Date: Apr 28, 2023
Citations: 4

Similar Papers

3D Object Detection Using Multiple-Frame Proposal Features Fusion.
Minyuan Huang ... Henry Leung
Sensors (Basel, Switzerland) | VOL. 23
Minyuan Huang, et. al.Minyuan Huang ... Henry Leung
14 Nov 2023
Sensors (Basel, Switzerland) | VOL. 23

MFEFNet: A Multi-Scale Feature Information Extraction and Fusion Network for Multi-Scale Object Detection in UAV Aerial Images
Liming Zhou ... Yadi Wang
Drones | VOL. 8
Liming Zhou, et. al.Liming Zhou ... Yadi Wang
08 May 2024
Drones | VOL. 8

Steel Strip Surface Defect Detection Method Based on Improved YOLOv5s.
Jianbo Lu ... Xiaoya Ma
Biomimetics (Basel, Switzerland) | VOL. 9
Jianbo Lu, et. al.Jianbo Lu ... Xiaoya Ma
03 Jan 2024
Biomimetics (Basel, Switzerland) | VOL. 9

Data safety prediction using YOLOv7+G3HN for traffic roads
Lek Ming Lim ... Majid Khan Majahar Ali
Journal of the Nigerian Society of Physical Sciences | VOL. -
Lek Ming Lim, et. al.Lek Ming Lim ... Majid Khan Majahar Ali
10 Aug 2024
Journal of the Nigerian Society of Physical Sciences | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A complementary dual-backbone transformer extracting and fusing weak cues for object detection in extremely dark videos

Abstract

Talk to us

Similar Papers

More From: Information Fusion