What I See Is What You See

Huangyue Yu,Feng Lu,Minjie Cai,Yunfei Liu

doi:10.1145/3343031.3350896

Abstract

In recent years, more and more videos are captured from the first-person viewpoint by wearable cameras. Such first-person video provides additional information besides the traditional third-person video, and thus has a wide range of applications. However, techniques for analyzing the first-person video can be fundamentally different from those for the third-person video, and it is even more difficult to explore the shared information from both viewpoints. In this paper, we propose a novel method for first- and third-person video co-analysis. At the core of our method is the notion of "joint attention", indicating the learnable representation that corresponds to the shared attention regions in different viewpoints and thus links the two viewpoints. To this end, we develop a multi-branch deep network with a triplet loss to extract the joint attention from the first- and third-person videos via self-supervised learning. We evaluate our method on the public dataset with cross-viewpoint video matching tasks. Our method outperforms the state-of-the-art both qualitatively and quantitatively. We also demonstrate how the learned joint attention can benefit various applications through a set of additional experiments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

What I See Is What You See

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Eye Contact Detection from Third Person Video
Yuki Ohshima ... Atsushi Nakazawa
-
Yuki Ohshima, et. al.Yuki Ohshima ... Atsushi Nakazawa
01 Jan 2020
01 Jan 2020

Joint Person Segmentation and Identification in Synchronized First- and Third-Person Videos
Mingze Xu ... Yuchen Wang
-
Mingze Xu, et. al.Mingze Xu ... Yuchen Wang
01 Jan 2018
01 Jan 2018

First- And Third-Person Video Co-Analysis By Learning Spatial-Temporal Joint Attention.
Huangyue Yu ... Yunfei Liu
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Huangyue Yu, et. al.Huangyue Yu ... Yunfei Liu
12 Oct 2020
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Identifying First-Person Camera Wearers in Third-Person Videos
Chenyou Fan ... Michael S Ryoo
-
Chenyou Fan, et. al.Chenyou Fan ... Michael S Ryoo
01 Jul 2017
01 Jul 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

What I See Is What You See

Abstract

Talk to us

Similar Papers