Iterative Multiple Bounding-Box Refinements for Visual Tracking.

Giorgio Cruciata,Marco La Cascia,Liliana Lo Presti

doi:10.3390/jimaging8030061

Abstract

Single-object visual tracking aims at locating a target in each video frame by predicting the bounding box of the object. Recent approaches have adopted iterative procedures to gradually refine the bounding box and locate the target in the image. In such approaches, the deep model takes as input the image patch corresponding to the currently estimated target bounding box, and provides as output the probability associated with each of the possible bounding box refinements, generally defined as a discrete set of linear transformations of the bounding box center and size. At each iteration, only one transformation is applied, and supervised training of the model may introduce an inherent ambiguity by giving importance priority to some transformations over the others. This paper proposes a novel formulation of the problem of selecting the bounding box refinement. It introduces the concept of non-conflicting transformations and allows applying multiple refinements to the target bounding box at each iteration without introducing ambiguities during learning of the model parameters. Empirical results demonstrate that the proposed approach improves the iterative single refinement in terms of accuracy and precision of the tracking results.

Highlights

The identity transformation is included to account for the cases in which the bounding box must be accepted as it is. To implement such a strategy, the deep model takes as input the image patch corresponding to the currently estimated target bounding box, and provides as output the probability associated with each of the possible bounding box refinements
We aim to study the effect of formulating the problem of selecting the best target bounding box refinements in a different way
This work focused on tracking strategies where the target bounding box is refined iteratively by applying a sequence of transformations

Summary

Introduction

Visual object tracking aims to automatically locate a target in subsequent frames, generally by estimating the bounding box that encloses the target on the image plane [1]. In contrast to the object detection problem, where instances of predefined object classes are located on an image, in object tracking the target is often located in a class-agnostic way by considering only the information provided in an initial frame (for instance, the frame where the target first appears). It has been widely studied, visual tracking remains a challenging problem in real-world scenarios due to target occlusions, pose and appearance changes, and illumination variations [2]. MDNet has two main limitations: One is related to the sampling and classification at each frame of several bounding boxes to select the optimal one; the other limitation is related to the use of a regression model to refine the selected bounding box

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Imaging	Publication Date: Mar 3, 2022
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Iterative Multiple Bounding-Box Refinements for Visual Tracking.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Imaging

Lead the way for us

Similar Papers

Accurate and robust visual tracking using bounding box refinement and online sample filtering
Yijin Yang ... Xiaodong Gu
Signal Processing: Image Communication | VOL. 116
Yijin Yang, et. al.Yijin Yang ... Xiaodong Gu
22 Apr 2023
Signal Processing: Image Communication | VOL. 116

Drawing and Analysis of Bounding Boxes for Object Detection with Anchor-Based Models
Manav Madan ... Christoph Reich
-
Manav Madan, et. al.Manav Madan ... Christoph Reich
01 Jan 2023
01 Jan 2023

Fast Visual Object Tracking using Ellipse Fitting for Rotated Bounding Boxes
Bao Xin Chen ... John Tsotsos
-
Bao Xin Chen, et. al.Bao Xin Chen ... John Tsotsos
01 Oct 2019
01 Oct 2019

Multi target pigs tracking loss correction algorithm based on Faster R-CNN
...
International Journal of Agricultural and Biological Engineering | VOL. 11
, et. al. ...
29 Sep 2018
International Journal of Agricultural and Biological Engineering | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Iterative Multiple Bounding-Box Refinements for Visual Tracking.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Imaging