Transformer for multiple object tracking: Exploring locality to vision

Shan Wu,Amnir Hadachi,Chaoru Lu,Damien Vivet

doi:10.1016/j.patrec.2023.04.016

Abstract

Multi-object tracking (MOT) is a critical task in various domains, such as traffic analysis, surveillance, and autonomous vehicles. The joint-detection-and-tracking paradigm has been extensively researched, which is faster and more convenient for training and deploying over the classic tracking-by-detection paradigm while achieving state-of-the-art performance. This paper explores the possibilities of enhancing the MOT system by leveraging the prevailing convolutional neural network (CNN) and a novel vision transformer technique Locality. There are several deficiencies in the transformer adopted for computer vision tasks. While the transformers are good at modeling global information for a long embedding, the locality mechanism, which learns the local features, is missing. This could lead to negligence of small objects, which may cause security issues. We combine the TransTrack MOT system with the locality mechanism inspired by LocalViT and find that the locality-enhanced system outperforms the baseline TransTrack by 5.3% MOTA on the MOT17 dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transformer for multiple object tracking: Exploring locality to vision

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: Apr 30, 2023
Citations: 5

Similar Papers

Real-Time Bird’s Eye View Multi-Object Tracking system based on Fast Encoders for Object Detection
Carlos Gomez-Huelamo ... Rafael Barea
-
Carlos Gomez-Huelamo, et. al.Carlos Gomez-Huelamo ... Rafael Barea
20 Sep 2020
20 Sep 2020

UA-DETRAC: A new benchmark and protocol for multi-object detection and tracking
Longyin Wen ... Siwei Lyu
Computer Vision and Image Understanding | VOL. 193
Longyin Wen, et. al.Longyin Wen ... Siwei Lyu
27 Jan 2020
Computer Vision and Image Understanding | VOL. 193

Real-Time Multiobject Tracking Based on Multiway Concurrency.
Xuan Gong ... Hui Wang
Sensors | VOL. 21
Xuan Gong, et. al.Xuan Gong ... Hui Wang
20 Jan 2021
Sensors | VOL. 21

Analysis Based on Recent Deep Learning Approaches Applied in Real-Time Multi-Object Tracking: A Review
Lesole Kalake ... Wanggen Wan
IEEE Access | VOL. 9
Lesole Kalake, et. al.Lesole Kalake ... Wanggen Wan
01 Jan 2020
IEEE Access | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transformer for multiple object tracking: Exploring locality to vision

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters