Coding visual features extracted from video sequences.

Luca Baroffio,Matteo Cesana,Stefano Tubaro,Marco Tagliasacchi,Alessandro Redondi

doi:10.1109/tip.2014.2312617

Abstract

Visual features are successfully exploited in several applications (e.g., visual search, object recognition and tracking, etc.) due to their ability to efficiently represent image content. Several visual analysis tasks require features to be transmitted over a bandwidth-limited network, thus calling for coding techniques to reduce the required bit budget, while attaining a target level of efficiency. In this paper, we propose, for the first time, a coding architecture designed for local features (e.g., SIFT, SURF) extracted from video sequences. To achieve high coding efficiency, we exploit both spatial and temporal redundancy by means of intraframe and interframe coding modes. In addition, we propose a coding mode decision based on rate-distortion optimization. The proposed coding scheme can be conveniently adopted to implement the analyze-then-compress (ATC) paradigm in the context of visual sensor networks. That is, sets of visual features are extracted from video frames, encoded at remote nodes, and finally transmitted to a central controller that performs visual analysis. This is in contrast to the traditional compress-then-analyze (CTA) paradigm, in which video sequences acquired at a node are compressed and then sent to a central unit for further processing. In this paper, we compare these coding paradigms using metrics that are routinely adopted to evaluate the suitability of visual features in the context of content-based retrieval, object recognition, and tracking. Experimental results demonstrate that, thanks to the significant coding gains achieved by the proposed coding scheme, ATC outperforms CTA with respect to all evaluation metrics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society	Publication Date: May 1, 2014
Citations: 92	License type: cc-by

R Discovery Prime

R Discovery Prime

Coding visual features extracted from video sequences.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

Lead the way for us

Similar Papers

Coding Local and Global Binary Visual Features Extracted From Video Sequences.
Luca Baroffio ... Marco Tagliasacchi
IEEE Transactions on Image Processing | VOL. 24
Luca Baroffio, et. al.Luca Baroffio ... Marco Tagliasacchi
12 Jun 2015
IEEE Transactions on Image Processing | VOL. 24

Coding binary local features extracted from video sequences
Luca Baroffio ... Marco Tagliasacchi
-
Luca Baroffio, et. al.Luca Baroffio ... Marco Tagliasacchi
01 Oct 2014
01 Oct 2014

Coding video sequences of visual features
Luca Baroffio ... Alessandro Redondi
-
Luca Baroffio, et. al.Luca Baroffio ... Alessandro Redondi
01 Sep 2013
01 Sep 2013

Extraction of Visual Features from Video Sequences for Better Visual Analysis
Prachi Rohit Rajarapollu ... Vijay R Mankar
-
Prachi Rohit Rajarapollu, et. al.Prachi Rohit Rajarapollu ... Vijay R Mankar
01 Aug 2018
01 Aug 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Coding visual features extracted from video sequences.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society