A generic mid-level representation for semantic video analysis

Qing Tang Qing Tang,Haiping Sun Haiping Sun,J.S Jin,Qi Tian Qi Tian,Joo-Hwee Lim Joo-Hwee Lim

doi:10.1109/icip.2004.1418833

A generic mid-level representation for semantic video analysis

Qing Tang Qing Tang, Haiping Sun Haiping Sun + Show 3 more

https://doi.org/10.1109/icip.2004.1418833

Copy DOI

Publication Date: Oct 24, 2004

Citations: 4

Affiliation: University of Sydney, Institute for Infocomm Research

#Semantic Analysis #Semantic Video Analysis + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The paper presents a generic, mid-level representation for efficient semantic video analysis, which adopts a frame-by-frame scheme using P-frames rather than shot-based schemes. Each P-frame is partitioned into an m/spl times/n grid (row by column), and each cell is called a 'block'. The representation can bridge the semantic gap and build an intermediate description of video features across frames and blocks. Soccer video is used to showcase the potential of the framework for real video processing. Experiments with tennis video and news video have also been conducted. Results demonstrate the excellent performance of the framework in semantic analysis and also indicate its further potential for automatic video analysis.

Full Text