SODFormer: Streaming Object Detection With Transformer Using Events and Frames.

Dianze Li,Yonghong Tian,Jianing Li

doi:10.1109/tpami.2023.3298925

Abstract

DAVIS camera, streaming two complementary sensing modalities of asynchronous events and frames, has gradually been used to address major object detection challenges (e.g., fast motion blur and low-light). However, how to effectively leverage rich temporal cues and fuse two heterogeneous visual streams remains a challenging endeavor. To address this challenge, we propose a novel streaming object detector with Transformer, namely SODFormer, which first integrates events and frames to continuously detect objects in an asynchronous manner. Technically, we first build a large-scale multimodal neuromorphic object detection dataset (i.e., PKU-DAVIS-SOD) over 1080.1 k manual labels. Then, we design a spatiotemporal Transformer architecture to detect objects via an end-to-end sequence prediction problem, where the novel temporal Transformer module leverages rich temporal cues from two visual streams to improve the detection performance. Finally, an asynchronous attention-based fusion module is proposed to integrate two heterogeneous sensing modalities and take complementary advantages from each end, which can be queried at any time to locate objects and break through the limited output frequency from synchronized frame-based fusion strategies. The results show that the proposed SODFormer outperforms four state-of-the-art methods and our eight baselines by a significant margin. We also show that our unifying framework works well even in cases where the conventional frame-based camera fails, e.g., high-speed motion and low-light conditions. Our dataset and code can be available at https://github.com/dianzl/SODFormer.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SODFormer: Streaming Object Detection With Transformer Using Events and Frames.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Nov 1, 2023
Citations: 5

Similar Papers

Retinomorphic Object Detection in Asynchronous Visual Streams
Jianing Li ... Lin Zhu
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Jianing Li, et. al.Jianing Li ... Lin Zhu
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

A Large-scale Detection Algorithm and Application Based on YOLOv4
Xiangbin Shi ... Jinwen Peng
-
Xiangbin Shi, et. al.Xiangbin Shi ... Jinwen Peng
01 Oct 2021
01 Oct 2021

Sensor-Fused Low Light Pedestrian Detection System with Transfer Learning
Bharath Kumar Thota ... Jungme Park
-
Bharath Kumar Thota, et. al.Bharath Kumar Thota ... Jungme Park
09 Apr 2024
09 Apr 2024

Towards Large-Scale Small Object Detection: Survey and Benchmarks.
Gong Cheng ... Xiwen Yao
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Gong Cheng, et. al.Gong Cheng ... Xiwen Yao
01 Jan 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SODFormer: Streaming Object Detection With Transformer Using Events and Frames.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence