Abstract

Some tracking algorithms based on Siamese network have made great progress in similarity learning via features cross-correlation between an object branch and a search branch. However, it is significantly challenging for object tracking in video sequences in terms of target deformation with greatly varying. We propose a Siamese network based on global and local feature matching for object tracking including three phases with the aim of addressing the above issues. In the first phase, obtaining the global similarity matching and local relational mapping similarity of the template branch and the search branch by a selection mechanism of object template-aware features are to reduce the impact of background features on the local matching. In the second phase, introducing correlation matching of the local feature for establishing correspondence among partial-level pixels. Finally, combining the classification and regression results with global matching features and local matching features in a weighted fusion. Extensive experiments are conducted on datasets (OTB-100, LaSOT and GOT-10K) demonstrate that the proposed network enables to achieve superiority compared against the state-of-the-art method and provides an efficient scenario for tackling the issue.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call