Transductive Multi-Modality Video Semantic Concept Detection with Tensor Representation

Fei Wu,Ya-Nan Liu,Yue-Ting Zhuang

doi:10.3724/sp.j.1001.2008.02853

Abstract

提出了一种基于高阶张量表示的视频语义分析与理解框架.在此框架中,视频镜头首先被表示成由视频中所包含的文本、视觉和听觉等多模态数据构成的三阶张量;其次,基于此三阶张量表达及视频的时序关联共生特性设计了一种子空间嵌入降维方法,称为张量镜头;由于直推式学习从已知样本出发能对特定的未知样本进行学习和识别,最后在这个框架中提出了一种基于张量镜头的直推式支持张量机算法,它不仅保持了张量镜头所在的流形空间的本征结构,而且能够将训练集合外数据直接映射到流形子空间,同时充分利用未标记样本改善分类器的学习性能.实验结果表明,该方法能够有效地进行视频镜头的语义概念检测.;A higher-order tensor framework for video analysis and understanding is proposed in this paper. In this framework, image frame, audio and text are represented, which are the three modalities in video shots as data points by the 3rd-order tensor. Then a subspace embedding and dimension reduction method is proposed, which explicitly considers the manifold structure of the tensor space from temporal-sequenced associated co-occurring multimodal media data in video. It is called TensorShot approach. Transductive learning uses a large amount of unlabeled data together with the labeled data to build better classifiers. A transductive support tensor machines algorithm is proposed to train effective classifier. This algorithm preserves the intrinsic structure of the submanifold where tensorshots are sampled, and is also able to map out-of-sample data points directly. Moreover, the utilization of unlabeled data improves classification ability. Experimental results show that this method improves the performance of video semantic concept detection.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transductive Multi-Modality Video Semantic Concept Detection with Tensor Representation

Abstract

Talk to us

Similar Papers

More From: Journal of Software

Lead the way for us

Journal: Journal of Software	Publication Date: Apr 7, 2009
Citations: 24

Similar Papers

Tensor-Based Transductive Learning for Multimodality Video Semantic Concept Detection
Fei Wu ... Yueting Zhuang
IEEE Transactions on Multimedia | VOL. 11
Fei Wu, et. al. Fei Wu ... Yueting Zhuang
01 Aug 2009
IEEE Transactions on Multimedia | VOL. 11

Active post-refined multimodality video semantic concept detection with tensor representation
Yanan Liu ... Fei Wu
-
Yanan Liu, et. al.Yanan Liu ... Fei Wu
26 Oct 2008
26 Oct 2008

Multi-modality video shot clustering with tensor representation
Yanan Liu ... Fei Wu
Multimedia Tools and Applications | VOL. 41
Yanan Liu, et. al.Yanan Liu ... Fei Wu
03 Oct 2008
Multimedia Tools and Applications | VOL. 41

Predicting cassette exons using transductive learning approaches
Ana Stanescu ... Doina Caragea
-
Ana Stanescu, et. al.Ana Stanescu ... Doina Caragea
01 Aug 2015
01 Aug 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transductive Multi-Modality Video Semantic Concept Detection with Tensor Representation

Abstract

Talk to us

Similar Papers

More From: Journal of Software