Optimization Strategies for High-Performance Computing of Optical-Flow in General-Purpose Processors

M Anguita,F.J Fernandez-Baldomero,J Diaz,E Ros

doi:10.1109/tcsvt.2009.2026821

Abstract

In this paper, we describe the high-performance implementation of an optical-flow algorithm that takes advantage of the processor's architecture. Tuning the code, i.e., adapting it to take full advantage of the processor, is challenging, time consuming, and requires efficient programming at different levels but can lead to significant improvements in performance. The optimized implementation presented here is highly interesting for a number of applications since it delivers real-time motion estimations at high-image resolution on a PC or in an embedded system based on a general-purpose processor. In a 2.83 GHz Core 2 Quad PC, it achieves a speedup of 14 compared to our first code version and 2052.7f/s for the well-known 252 times 316 Yosemite sequence, and a speedup of 17.6 and 68.5 f/s for a 1016 times 1280 sequence. But the description of how this high-performance is achieved goes beyond a specific application since the paper presented here illustrates how inherently dense, low-level visual algorithms (pixel-wise computation) can be structured and improved to take full advantage of a standard processor. The implementation is compared with other hardware (based on FPGAs and GPUs) and software (based on clusters, PCs, and special-purpose processors) optical-flow implementations, showing that it outperforms them.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimization Strategies for High-Performance Computing of Optical-Flow in General-Purpose Processors

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Oct 1, 2009
Citations: 57

Similar Papers

Flexible processors: a promising application-specific processor design approach
...
-
, et. al. ...
03 Jan 1988
03 Jan 1988

Flexible Processors: A Promising Application-specific Processor Design Approach
...
-
, et. al. ...
06 Aug 2002
06 Aug 2002

Hardware design and implementation of scalar multiplication in elliptic curve cryptography (ECC) over GF(2163) on FPGA
Malik Imran ... Muhammad Rashid
-
Malik Imran, et. al.Malik Imran ... Muhammad Rashid
01 Dec 2015
Hardware design and implementation of scalar multiplication in elliptic curve cryptography (ECC) over GF(2163) on FPGA
Malik Imran ... Muhammad Rashid

Developing Embedded Kernel for System-On-a-Chip Platform of Heterogeneous Multiprocessor Architecture
Jing Chen ... Jian-Hong Liu
-
Jing Chen, et. al. Jing Chen ... Jian-Hong Liu
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimization Strategies for High-Performance Computing of Optical-Flow in General-Purpose Processors

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society