Selective Information Flow for Transformer Tracking

Janani Kugarajeevan,Kokul Thanikasalam,Amirthalingam Ramanan,Subha Fernando

doi:10.1016/j.eswa.2024.125381

Abstract

Fully transformer-based one-stream trackers have demonstrated outstanding performance on challenging benchmark datasets over the past three years. These trackers enable bidirectional information flow between all target template and search region tokens to compute attention features without assessing its impact on the tracker’s discriminative capability. Our study found that the information flow from a large portion of background tokens in the search region diminishes the importance of the target-specific features of the template. Moreover, previous transformer-based trackers fail to consider cues from the dynamic background region, even though it contains information about distracting similar objects. To address the identified limitations in transformer tracking, we propose a novel Selective Information Flow Tracking (SIFTrack) framework to enhance the tracker’s discriminative capability by selectively allowing information flow between different types of token groups. In the early encoder layers of the proposed SIFTrack, interactions from all search tokens to target template tokens are blocked to enrich target-specific feature extraction. In the deeper encoder layers, search tokens are partitioned into target and non-target tokens based on their attention scores. Then, bidirectional flow from target search tokens to template tokens is allowed to capture the appearance changes of the target. In addition, by including tokens from the dynamic background, SIFTrack effectively avoids distractor objects by capturing cues from the area surrounding the target. The proposed SIFTrack demonstrated outstanding performance in challenging benchmarks, particularly excelling in the one-shot tracking benchmark GOT-10k, achieving an average overlap of 74.6%. The code, models, and results of this work are available at https://github.com/JananiKugaa/SIFTrack.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Selective Information Flow for Transformer Tracking

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Similar Papers

Adaptive temporal feature modeling for visual tracking via cross-channel learning
Yuanyun Wang ... Jun Wang
Knowledge-Based Systems | VOL. 265
Yuanyun Wang, et. al.Yuanyun Wang ... Jun Wang
14 Feb 2023
Knowledge-Based Systems | VOL. 265

SiamX: An Efficient Long-term Tracker Using Cross-level Feature Correlation and Adaptive Tracking Scheme
Huajian Huang ... Sai-Kit Yeung
-
Huajian Huang, et. al.Huajian Huang ... Sai-Kit Yeung
23 May 2022
23 May 2022

Techno-Politics as Network(ed) Struggles
Laura Fichtner
-
Laura FichtnerLaura Fichtner
30 Jun 2015
30 Jun 2015

BEFSR: A Multiple Attention-Based Model Considering Bidirectional Entity Information Flows and Few-Shot Relations
Xiaobo Guo ... Yao Dong
-
Xiaobo Guo, et. al.Xiaobo Guo ... Yao Dong
21 Aug 2022
21 Aug 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Selective Information Flow for Transformer Tracking

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications