Bridging low-level features and high-level semantics via fMRI brain imaging for video classification

Xintao Hu,Hanbo Chen,Lei Guo,Jinglei Lv,Li Xie,Carlos Faraco,Arsham Mesbah,Tianming Liu,Junwei Han,Dajiang Zhu,Tuo Zhang,Degang Zhang,Xi Jiang,Fan Deng,Stephen Miller,Kaiming Li,Xiansheng Hua

doi:10.1145/1873951.1874016

Abstract

The multimedia content analysis community has made significant effort to bridge the gap between low-level features and high-level semantics perceived by human cognitive systems such as real-world objects and concepts. In the two fields of multimedia analysis and brain imaging, both topics of low-level features and high level semantics are extensively studied. For instance, in the multimedia analysis field, many algorithms are available for multimedia feature extraction, and benchmark datasets are available such as the TRECVID. In the brain imaging field, brain regions that are responsible for vision, auditory perception, language, and working memory are well studied via functional magnetic resonance imaging (fMRI). This paper presents our initial effort in marrying these two fields in order to bridge the gaps between low-level features and high-level semantics via fMRI brain imaging. Our experimental paradigm is that we performed fMRI brain imaging when university student subjects watched the video clips selected from the TRECVID datasets. At current stage, we focus on the three concepts of sports, weather, and commercial-/advertisement specified in the TRECVID 2005. Meanwhile, the brain regions in vision, auditory, language, and working memory networks are quantitatively localized and mapped via task-based paradigm fMRI, and the fMRI responses in these regions are used to extract features as the representation of the brain's comprehension of semantics. Our computational framework aims to learn the most relevant low-level feature sets that best correlate the fMRI-derived semantics based on the training videos with fMRI scans, and then the learned models are applied to larger scale test datasets without fMRI scans for category classifications. Our result shows that: 1) there are meaningful couplings between brain's fMRI responses and video stimuli, suggesting the validity of linking semantics and low-level features via fMRI; 2) The computationally learned low-level feature sets from fMRI-derived semantic features can significantly improve the classification of video categories in comparison with that based on original low-level features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bridging low-level features and high-level semantics via fMRI brain imaging for video classification

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Bridging the Semantic Gap via Functional Brain Imaging
Xintao Hu ... Tianming Liu
IEEE Transactions on Multimedia | VOL. 14
Xintao Hu, et. al.Xintao Hu ... Tianming Liu
01 Apr 2012
IEEE Transactions on Multimedia | VOL. 14

Learning Computational Models of Video Memorability from fMRI Brain Imaging.
Junwei Han ... Changyuan Chen
IEEE transactions on cybernetics | VOL. 45
Junwei Han, et. al.Junwei Han ... Changyuan Chen
09 Oct 2014
IEEE transactions on cybernetics | VOL. 45

Decoding Auditory Saliency from FMRI Brain Imaging
Shijie Zhao ... Dajiang Zhu
-
Shijie Zhao, et. al.Shijie Zhao ... Dajiang Zhu
03 Nov 2014
03 Nov 2014

Semantic-based facial expression recognition using analytical hierarchy process
Shyi-Chyi Cheng ... Ming-Yao Chen
Expert Systems with Applications | VOL. 33
Shyi-Chyi Cheng, et. al.Shyi-Chyi Cheng ... Ming-Yao Chen
06 May 2006
Expert Systems with Applications | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bridging low-level features and high-level semantics via fMRI brain imaging for video classification

Abstract

Talk to us

Similar Papers