Geometric bounding box interpolation: an alternative for efficient video annotation

Pedro Gil-Jiménez,Roberto López-Sastre,Saturnino Maldonado-Bascón,Hilario Gómez-Moreno

doi:10.1186/s13640-016-0108-7

Pedro Gil-Jiménez, Roberto López-Sastre + Show 2 more

Open Access

https://doi.org/10.1186/s13640-016-0108-7

Copy DOI

Abstract

In video annotation, instead of annotating every frame of a trajectory, usually only a sparse set of annotations is provided by the user: typically its endpoints plus some key intermediate frames, interpolating the remaining annotations between these key frames in order to reduce the cost of the video labeling. While a number of video annotation tools have been proposed, some of which are freely available, and bounding box interpolation is mainly based on image processing techniques whose performance is highly dependent on image quality, occlusions, etc. We propose an alternative method to interpolate bounding box annotations, based on cubic splines and the geometric properties of the elements involved, rather than image processing techniques. The algorithm proposed is compared with other bounding box interpolation methods described in the literature, using a set of selected videos modeling different types of object and camera motion. Experiments show that the accuracy of the interpolated bounding boxes is higher than the accuracy of the other evaluated methods, especially when considering rigid objects. The main goal of this paper is related with the bounding box interpolation step, and we believe that our design can be integrated seamlessly with any annotation tool already developed.

Highlights

The growth in image and video processing demands larger quantities of annotated training data
4 Results and discussion In order to test the interpolation schema described in this paper, the proposed algorithm has been implemented in Python, using OpenCV
For the cubic spline interpolation, we used the functions implemented in the SciPy library [19]

Summary

Introduction

The growth in image and video processing demands larger quantities of annotated training data. To take into account this effect, Yuen et al [14] pointed out that ‘a constant velocity in space does not project to a constant velocity in the image plane, due to perspective effects.’ In their work, they proposed an alternative method to project the coordinates of the annotated object to the image plane for any time between two key frames. This improvement allows us to use cubic splines to interpolate the spatial bounding box coordinates, instead of using linear interpolation, to model object trajectories.

Reconstruction for a single segment

Reconstruction from multiple segments

Results and discussion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Image and Video Processing	Publication Date: Feb 22, 2016
Citations: 10	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Geometric bounding box interpolation: an alternative for efficient video annotation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Image and Video Processing

Lead the way for us

Similar Papers

Towards Extracting Semantically Meaningful Key Frames From Personal Video Clips: From Humans to Computers
Jiebo Luo ... C Papin
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 19
Jiebo Luo, et. al. Jiebo Luo ... C Papin
01 Feb 2009
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 19

Reinforcing Web-object Categorization Through Interrelationships
Gui-Rong Xue ... Dou Shen
Data Mining and Knowledge Discovery | VOL. 12
Gui-Rong Xue, et. al.Gui-Rong Xue ... Dou Shen
04 Apr 2006
Data Mining and Knowledge Discovery | VOL. 12

Efficient video annotation with visual interpolation and frame selection guidance
Alina Kuznetsova ... Keith Simmons
-
Alina Kuznetsova, et. al.Alina Kuznetsova ... Keith Simmons
01 Jan 2020
01 Jan 2020

Efficient processing of neighboring skyline queries with consideration of distance, quality, and cost
Yuan-Ko Huang
Computing | VOL. 102
Yuan-Ko HuangYuan-Ko Huang
27 Nov 2019
Computing | VOL. 102

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Geometric bounding box interpolation: an alternative for efficient video annotation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Image and Video Processing