Scalable software architecture for high performance video codec's on parallel processing engines

Krishnakanth Rapaka,Keshava Prasad,Mihir Mody

doi:10.1109/isce.2007.4382148

Abstract

Video algorithm (e.g. H.264, MPEG2/4 etc) requires tremendous amount of computation power and data bandwidth. This complexity depends on encoding vs. decoding mode, video standard, resolution, frame-rate and visual quality constraints. Many video architecture solutions typically use multiple processing elements (e.g. multiple DSPs or MCU, DSP/MCU with dedicated accelerators or FPGA etc) to achieve the high computation requirements for video algorithms. These architectures provide new challenges to video software's that are typically designed to run on a single processor. This paper presents software design for a video architecture using parallel processing elements. This paper explains following aspects in detail a) Software partitioning b) Algorithm specific optimizations c) Processor specific optimizations d) Efficient DMA/Cache usage e) Concurrent scheduling of all parallel processing elements. The given approach is explained with example of MPEG4 encoder on TMS320DM6446, which is Davincitrade family device from Texas Instruments Ltd. The given software architecture is scalable for various video standards (e.g. H.264, MPEG2/4 etc) as well as various parallel processing hardware solutions. The software achieves performance Dl@30 fsp on given device at less than 50% of DSP load.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Scalable software architecture for high performance video codec's on parallel processing engines

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

<title>Column parallel vision system: CPV</title>
Naohisa Mukozaka ... Haruyoshi Toyoda
-
Naohisa Mukozaka, et. al.Naohisa Mukozaka ... Haruyoshi Toyoda
26 Apr 2002
26 Apr 2002

Bounds on Multiprocessing Timing Anomalies
R L Graham
SIAM Journal on Applied Mathematics | VOL. 17
R L GrahamR L Graham
01 Mar 1969
SIAM Journal on Applied Mathematics | VOL. 17

An efficient VLSI architecture for lifting based 1D/2D discrete wavelet transform
Mohamed Asan Basiri M ... Noor Mahammad Sk
Microprocessors and Microsystems | VOL. 47
Mohamed Asan Basiri M, et. al.Mohamed Asan Basiri M ... Noor Mahammad Sk
31 Aug 2016
Microprocessors and Microsystems | VOL. 47

A Viterbi decoder architecture based on parallel processing elements
S.R Meier
-
S.R MeierS.R Meier
02 Dec 1990
02 Dec 1990

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scalable software architecture for high performance video codec's on parallel processing engines

Abstract

Talk to us

Similar Papers