Temporal Video Segmentation to Scenes Using High-Level Audiovisual Features

P Sidiropoulos,V Mezaris,I Trancoso,M Bugalho,H Meinedo,I Kompatsiaris

doi:10.1109/tcsvt.2011.2138830

Abstract

In this paper, a novel approach to video temporal decomposition into semantic units, termed scenes, is presented. In contrast to previous temporal segmentation approaches that employ mostly low-level visual or audiovisual features, we introduce a technique that jointly exploits low-level and high-level features automatically extracted from the visual and the auditory channel. This technique is built upon the well-known method of the scene transition graph (STG), first by introducing a new STG approximation that features reduced computational cost, and then by extending the unimodal STG-based temporal segmentation technique to a method for multimodal scene segmentation. The latter exploits, among others, the results of a large number of TRECVID-type trained visual concept detectors and audio event detectors, and is based on a probabilistic merging process that combines multiple individual STGs while at the same time diminishing the need for selecting and fine-tuning several STG construction parameters. The proposed approach is evaluated on three test datasets, comprising TRECVID documentary films, movies, and news-related videos, respectively. The experimental results demonstrate the improved performance of the proposed approach in comparison to other unimodal and multimodal techniques of the relevant literature and highlight the contribution of high-level audiovisual features toward improved video segmentation to scenes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Temporal Video Segmentation to Scenes Using High-Level Audiovisual Features

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Aug 1, 2011
Citations: 179

Similar Papers

Multi-modal scene segmentation using scene transition graphs
Panagiotis Sidiropoulos ... Vasileios Mezaris
-
Panagiotis Sidiropoulos, et. al.Panagiotis Sidiropoulos ... Vasileios Mezaris
19 Oct 2009
19 Oct 2009

Image Feature Types and Their Predictions of Aesthetic Preference and Naturalness.
Frank F Ibarra ... Hiroki P Kotabe
Frontiers in Psychology | VOL. 8
Frank F Ibarra, et. al.Frank F Ibarra ... Hiroki P Kotabe
28 Apr 2017
Frontiers in Psychology | VOL. 8

On the Use of Audio Events for Improving Video Scene Segmentation
Panagiotis Sidiropoulos ... Hugo Meinedo
-
Panagiotis Sidiropoulos, et. al.Panagiotis Sidiropoulos ... Hugo Meinedo
08 Aug 2012
08 Aug 2012

Content Based Image Retrieval
Showkat Dar
IOSR Journal of Computer Engineering | VOL. 12
Showkat DarShowkat Dar
01 Jan 2013
IOSR Journal of Computer Engineering | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Temporal Video Segmentation to Scenes Using High-Level Audiovisual Features

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology