Cross-media web video topic detection based on heterogeneous interactive tensor learning

Chengde Zhang,Kai Mei,Xia Xiao

doi:10.1016/j.knosys.2023.111153

Abstract

Topic detection based on text reasoning has attracted widespread attention. Existing methods focus on inference based on textual semantic cues. However, each video is described with only a few words, resulting in sparse textual reasoning cues. In this situation, it is difficult to distinguish videos belonging to the same topic, making topic detection for web videos challenging. Fortunately, visual information contains many more detailed cues than textual information, such as colors, scenes, and objects. Cross-media joint reasoning provides more reasoning cues in a complementary manner than textual information. In view of this, this paper extends topic detection based on text reasoning to cross-media reasoning. A novel heterogeneous interactive tensor learning (HITL) method is proposed, which detects topics through cross-media joint inference. After extracting local features of keyframes and textual information, the semantic correlation between visual and textual information is mined by constructing a keyframe-text interaction attention matrix. Then, a joint cue between textual and visual information is constructed in a cross-media heterogeneous interaction tensor space, thereby achieving rich textual cues through cross-media fusion. Finally, semantic features are extracted through cue interaction in tensor space for topic detection.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cross-media web video topic detection based on heterogeneous interactive tensor learning

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: Nov 17, 2023
Citations: 1

Similar Papers

Influences of narcissism and parental mediation on adolescents' textual and visual personal information disclosure in Facebook
Cong Liu ... May O Lwin
Computers in Human Behavior | VOL. 58
Cong Liu, et. al.Cong Liu ... May O Lwin
02 Jan 2015
Computers in Human Behavior | VOL. 58

Visual Information Matters for ASR Error Correction
Vanya Bannihatti Kumar ... Shanbo Cheng
-
Vanya Bannihatti Kumar, et. al.Vanya Bannihatti Kumar ... Shanbo Cheng
04 Jun 2023
04 Jun 2023

Textual Primacy Online: Impression Formation Based on Textual and Visual Cues in Facebook Profiles
Ayellet Pelled ... Tanya Zilberstein
American Behavioral Scientist | VOL. 61
Ayellet Pelled, et. al.Ayellet Pelled ... Tanya Zilberstein
01 Jun 2017
American Behavioral Scientist | VOL. 61

Bayesian Visual Reranking
Xinmie Tian ... Jingdong Wang
IEEE Transactions on Multimedia | VOL. 13
Xinmie Tian, et. al.Xinmie Tian ... Jingdong Wang
01 Aug 2011
IEEE Transactions on Multimedia | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross-media web video topic detection based on heterogeneous interactive tensor learning

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems