Abstract

Visual object tracking has been a concern topic these years, and many trackers have achieved good results in various fields. These researches and breakthroughs have made many improvements to solve problems such as drift, lighting, deformation and occlusion. In this paper, we improve the structure of the AlexNet[1] network by designing the three important influencing factors of the receptive field size, total network step size, and feature filling of the twin network. Apart from this, we add a smoothing matrices and a background suppression matrices to effectively learn the features of the first few frames as much as possible. Fuse multilayer feature elements can learn online about target appearance changes and background suppression, and we train them by using continuous video sequences.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.