InteractNet: Social Interaction Recognition for Semantic-rich Videos

Yuanjie Lyu,Penggang Qin,Tong Xu,Chen Zhu,Enhong Chen

doi:10.1145/3663668

Abstract

The overwhelming surge of online video platforms has raised an urgent need for social interaction recognition techniques. Compared with simple short-term actions, long-term social interactions in semantic-rich videos could reflect more complicated semantics such as character relationships or emotions, which will better support various downstream applications, e.g., story summarization and fine-grained clip retrieval. However, considering the longer duration of social interactions with severe mutual overlap, involving multiple characters, dynamic scenes, and multi-modal cues, among other factors, traditional solutions for short-term action recognition may probably fail in this task. To address these challenges, in this article, we propose a hierarchical graph-based system, named InteractNet, to recognize social interactions in a multi-modal perspective. Specifically, our approach first generates a semantic graph for each sampled frame with integrating multi-modal cues and then learns the node representations as short-term interaction patterns via an adapted GCN module. Along this line, global interaction representations are accumulated through a sub-clip identification module, effectively filtering out irrelevant information and resolving temporal overlaps between interactions. In the end, the association among simultaneous interactions will be captured and modelled by constructing a global-level character-pair graph to predict the final social interactions. Comprehensive experiments on publicly available datasets demonstrate the effectiveness of our approach compared with state-of-the-art baseline methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

InteractNet: Social Interaction Recognition for Semantic-rich Videos

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Journal: ACM Transactions on Multimedia Computing, Communications, and Applications	Publication Date: Jun 12, 2024
License type: mit

Similar Papers

Application of Robot Autobiographical Memory in Long - Term Human - Robot Social Interactions
M M.S N Edirisinghe ... A G.B P Jayasekara
-
M M.S N Edirisinghe, et. al.M M.S N Edirisinghe ... A G.B P Jayasekara
01 Sep 2018
01 Sep 2018

Tayangan Video Animasi “Si Nopal” Untuk Mendukung Interaksi Sosial Siswa Sekolah Dasar
Chumi Zahroul Fitriyah ... Zetti Finali
Scholaria: Jurnal Pendidikan dan Kebudayaan | VOL. 10
Chumi Zahroul Fitriyah, et. al.Chumi Zahroul Fitriyah ... Zetti Finali
28 Sep 2020
Scholaria: Jurnal Pendidikan dan Kebudayaan | VOL. 10

Social interaction and cortisol treatment increase cell addition and radial glia fiber density in the diencephalic periventricular zone of adult electric fish, Apteronotus leptorhynchus
Kent D Dunlap ... Erealda Prendaj
Hormones and Behavior | VOL. 50
Kent D Dunlap, et. al.Kent D Dunlap ... Erealda Prendaj
03 Apr 2006
Hormones and Behavior | VOL. 50

Prediction of Emotional Empathy in Intelligent Agents to Facilitate Precise Social Interaction
Saad Awadh Alanazi ... Nasser Alshammari
Applied Sciences | VOL. 13
Saad Awadh Alanazi, et. al.Saad Awadh Alanazi ... Nasser Alshammari
15 Jan 2023
Applied Sciences | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

InteractNet: Social Interaction Recognition for Semantic-rich Videos

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications