VLFSE: Enhancing visual tracking through visual language fusion and state update evaluator

Fuchao Yang,Mingkai Jiang,Qiaohong Hao,Xiaolei Zhao,Qinghe Feng

doi:10.1016/j.mlwa.2024.100588

Fuchao Yang, Mingkai Jiang + Show 3 more

Open Access

https://doi.org/10.1016/j.mlwa.2024.100588

Copy DOI

Export

Save

Cite

Journal: Machine Learning with Applications	Publication Date: Sep 30, 2024
License type: cc-by-nc-nd

Abstract
Full-Text
Similar Papers

Abstract

Listen

Recently, visual tracking algorithms have achieved impressive results by combining dynamic templates. However, the instability of visual images and the incorrect timing of template updates lead to decreased tracking accuracy and stability in intricate scenarios. To address these issues, we propose a visual tracking algorithm through visual language fusion and a state update evaluator (VLFSE). Specifically, our approach introduces a multimodal attention mechanism that uses self-attention to mine and integrate information from diverse sources effectively. This mechanism ensures a richer, context-aware representation of the target, enabling more accurate tracking even in complex scenes. Moreover, we recognize the critical need for precise template updates to maintain tracking accuracy over time. To this end, we develop a state update evaluator, a component trained online to assess the necessity and timing of template updates accurately. This evaluator acts as a safeguard, preventing erroneous updates and ensuring the tracker adapts optimally to changes in the target’s appearance. The experimental results on challenging visual language tracking datasets demonstrate our tracker’s superior performance, showcasing its adaptability and accuracy in complex tracking scenarios.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

VLFSE: Enhancing visual tracking through visual language fusion and state update evaluator

Abstract

Published Version

Talk to us

Similar Papers

More From: Machine Learning with Applications

Lead the way for us

Similar Papers

Performance Evaluation of Visual Tracking Algorithms on Video Sequences With Quality Degradation
Yuming Fang ... Weisi Lin
IEEE Access | VOL. 5
Yuming Fang, et. al.Yuming Fang ... Weisi Lin
01 Jan 2017
IEEE Access | VOL. 5

Research on the improvement of vision target tracking algorithm for Internet of things technology and Simple extended application in pellet ore phase
Jie Li ... Aimin Yang
Future Generation Computer Systems | VOL. 110
Jie Li, et. al.Jie Li ... Aimin Yang
13 Apr 2020
Future Generation Computer Systems | VOL. 110

Surgical Navigation System Based on the Visual Object Tracking Algorithm
Yan Pei-Lun ... Hua Chun-Sheng
-
Yan Pei-Lun, et. al.Yan Pei-Lun ... Hua Chun-Sheng
01 Apr 2018
01 Apr 2018

Visual Tracking Based on Complementary Learners with Distractor Handling
Suryo Adhi Wibowo ... Sungshin Kim
Mathematical Problems in Engineering | VOL. 2017
Suryo Adhi Wibowo, et. al.Suryo Adhi Wibowo ... Sungshin Kim
01 Jan 2017
Mathematical Problems in Engineering | VOL. 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

VLFSE: Enhancing visual tracking through visual language fusion and state update evaluator

Abstract

Published Version

Talk to us

Similar Papers

More From: Machine Learning with Applications