A ConvNext-Based and Feature Enhancement Anchor-Free Siamese Network for Visual Tracking

Qiguo Xu,Gang Liu,Xusheng Ruan,Zeyu Zhang,Honggui Deng,Yang Liu

doi:10.3390/electronics11152381

Qiguo Xu, Gang Liu + Show 4 more

Open Access

https://doi.org/10.3390/electronics11152381

Copy DOI

Journal: Electronics	Publication Date: Jul 29, 2022
Citations: 2	License type: CC BY 4.0

Affiliation: Central South University, Changsha Normal University

Abstract

Existing anchor-based Siamese trackers rely on the anchor’s design to predict the scale and aspect ratio of the target. However, these methods introduce many hyperparameters, leading to computational redundancy. In this paper, to achieve outstanding network efficiency, we propose a ConvNext-based anchor-free Siamese tracking network (CAFSN), which employs an anchor-free design to increase network flexibility and versatility. In CAFSN, to obtain an appropriate backbone network, the state-of-the-art ConvNext network is applied to the visual tracking field for the first time by improving the network stride and receptive field. Moreover, A central confidence branch based on Euclidean distance is offered to suppress low-quality prediction frames in the classification prediction network of CAFSN for robust visual tracking. In particular, we discuss that the Siamese network cannot establish a complete identification model for the tracking target and similar objects, which negatively impacts network performance. We build a Fusion network consisting of crop and 3Dmaxpooling to better distinguish the targets and similar objects’ abilities. This module uses 3DMaxpooling to select the highest activation value to improve the difference between it and other similar objects. Crop unifies the dimensions of different features and reduces the amount of computation. Ablation experiments demonstrate that this module increased success rates by 1.7% and precision by 0.5%. We evaluate CAFSN on challenging benchmarks such as OTB100, UAV123, and GOT-10K, validating advanced performance in noise immunity and similar target identification with 58.44 FPS in real time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A ConvNext-Based and Feature Enhancement Anchor-Free Siamese Network for Visual Tracking

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Visual Tracking with Attentional Convolutional Siamese Networks
Ke Tan ... Zhenzhong Wei
-
Ke Tan, et. al.Ke Tan ... Zhenzhong Wei
01 Jan 2019
01 Jan 2019

Two-stage aware attentional Siamese network for visual tracking
Xinglong Sun ... Qingqing Li
Pattern Recognition | VOL. 124
Xinglong Sun, et. al.Xinglong Sun ... Qingqing Li
21 Dec 2021
Pattern Recognition | VOL. 124

Ensemble learning with siamese networks for visual tracking
Junfei Zhuang ... Hongliang Bai
Neurocomputing | VOL. 464
Junfei Zhuang, et. al.Junfei Zhuang ... Hongliang Bai
11 Aug 2021
Neurocomputing | VOL. 464

Siamese Region Proposal Networks and Attention Module for Real-time Visual Tracking
Hang Dong ... Yuan Zeng
-
Hang Dong, et. al.Hang Dong ... Yuan Zeng
25 Dec 2020
25 Dec 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A ConvNext-Based and Feature Enhancement Anchor-Free Siamese Network for Visual Tracking

Abstract

Talk to us

Similar Papers

More From: Electronics