DASTSiam: Spatio‐temporal fusion and discriminative enhancement for Siamese visual tracking

Yucheng Huang,Lijuan Zhu,Jihong Zhu,Eksan Firkat,Jinlai Zhang,Askar Hamdulla,Bin Zhu

doi:10.1049/cvi2.12213

Abstract

AbstractThe use of deep neural networks has revolutionised object tracking tasks, and Siamese trackers have emerged as a prominent technique for this purpose. Existing Siamese trackers use a fixed template or template updating technique, but it is prone to overfitting, lacks the capacity to exploit global temporal sequences, and cannot utilise multi‐layer features. As a result, it is challenging to deal with dramatic appearance changes in complicated scenarios. Siamese trackers also struggle to learn background information, which impairs their discriminative ability. Hence, two transformer‐based modules, the Spatio‐Temporal Fusion (ST) module and the Discriminative Enhancement (DE) module, are proposed to improve the performance of Siamese trackers. The ST module leverages cross‐attention to accumulate global temporal cues and generates an attention matrix with ST similarity to enhance the template's adaptability to changes in target appearance. The DE module associates semantically similar points from the template and search area, thereby generating a learnable discriminative mask to enhance the discriminative ability of the Siamese trackers. In addition, a Multi‐Layer ST module (ST + ML) was constructed, which can be integrated into Siamese trackers based on multi‐layer cross‐correlation for further improvement. The authors evaluate the proposed modules on four public datasets and show comparative performance compared to existing Siamese trackers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IET Computer Vision	Publication Date: Jun 19, 2023
Citations: 1	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

DASTSiam: Spatio‐temporal fusion and discriminative enhancement for Siamese visual tracking

Abstract

Talk to us

Similar Papers

More From: IET Computer Vision

Lead the way for us

Similar Papers

Discriminative Siamese Tracker Based on Multi-Channel-Aware and Adaptive Hierarchical Deep Features
Huanlong Zhang ... Fengxian Wang
Symmetry | VOL. 13
Huanlong Zhang, et. al.Huanlong Zhang ... Fengxian Wang
05 Dec 2021
Symmetry | VOL. 13

SiamST: Siamese network with spatio-temporal awareness for object tracking
Hong Zhang ... Ding Yuan
Information Sciences | VOL. 634
Hong Zhang, et. al.Hong Zhang ... Ding Yuan
13 Mar 2023
Information Sciences | VOL. 634

Learning Temporal-Correlated and Channel- Decorrelated Siamese Networks for Visual Tracking
Mao Xi ... Wengang Zhou
IEEE Transactions on Multimedia | VOL. 24
Mao Xi, et. al.Mao Xi ... Wengang Zhou
01 Jan 2021
IEEE Transactions on Multimedia | VOL. 24

Channel Attention Based Generative Network for Robust Visual Tracking
Ying Hu ... Hanyu Xuan
-
Ying Hu, et. al.Ying Hu ... Hanyu Xuan
01 May 2020
01 May 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DASTSiam: Spatio‐temporal fusion and discriminative enhancement for Siamese visual tracking

Abstract

Talk to us

Similar Papers

More From: IET Computer Vision