Video parsing via spatiotemporally analysis with images

Xuelong Li,Xiaoqiang Lu,Lichao Mou

doi:10.1007/s11042-015-2735-x

Abstract

Effective parsing of video through the spatial and temporal domains is vital to many computer vision problems because it is helpful to automatically label objects in video instead of manual fashion, which is tedious. Some literatures propose to parse the semantic information on individual 2D images or individual video frames, however, these approaches only take use of the spatial information, ignore the temporal continuity information and fail to consider the relevance of frames. On the other hand, some approaches which only consider the spatial information attempt to propagate labels in the temporal domain for parsing the semantic information of the whole video, yet the non-injective and non-surjective natures can cause the black hole effect. In this paper, inspirited by some annotated image datasets (e.g., Stanford Background Dataset, LabelMe, and SIFT-FLOW), we propose to transfer or propagate such labels from images to videos. The proposed approach consists of three main stages: I) the posterior category probability density function (PDF) is learned by an algorithm which combines frame relevance and label propagation from images. II) the prior contextual constraint PDF on the map of pixel categories through whole video is learned by the Markov Random Fields (MRF). III) finally, based on both learned PDFs, the final parsing results are yielded up to the maximum a posterior (MAP) process which is computed via a very efficient graph-cut based integer optimization algorithm. The experiments show that the black hole effect can be effectively handled by the proposed approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Video parsing via spatiotemporally analysis with images

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Journal: Multimedia Tools and Applications	Publication Date: Jul 7, 2015
Citations: 6

Similar Papers

Scene Parsing From an MAP Perspective
Xuelong Li ... Lichao Mou
IEEE Transactions on Cybernetics | VOL. 45
Xuelong Li, et. al. Xuelong Li ... Lichao Mou
04 Nov 2014
IEEE Transactions on Cybernetics | VOL. 45

Investigation of the Effects of Probability Density Function Kurtosis on Evaluated Data Results
Donald L Smith ... Denise Neudecker
-
Donald L Smith, et. al.Donald L Smith ... Denise Neudecker
01 May 2020
01 May 2020

Investigation of the Effects of Probability Density Function Kurtosis on Evaluated Data Results
Donald L Smith ... Roberto Capote Noy
-
Donald L Smith, et. al.Donald L Smith ... Roberto Capote Noy
01 May 2018
01 May 2018

Investigation of the Effects of Probability Density Function Kurtosis on Evaluated Data Results
D.L Smith ... R Capote Noy
-
D.L Smith, et. al.D.L Smith ... R Capote Noy
01 May 2020
01 May 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Video parsing via spatiotemporally analysis with images

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications