FT-HID: a large-scale RGB-D dataset for first- and third-person human interaction analysis

Zihui Guo,Wanqing Li,Yonghong Hou,Pichao Wang,Mingliang Xu,Zhimin Gao

doi:10.1007/s00521-022-07826-w

Abstract

Analysis of human interaction is one important research topic of human motion analysis. It has been studied either using first-person vision (FPV) or third-person vision (TPV). However, the joint learning of both types of vision has so far attracted little attention. One of the reasons is the lack of suitable datasets that cover both FPV and TPV. In addition, existing benchmark datasets of either FPV or TPV have several limitations, including the limited number of samples, participant subjects, interaction categories, and modalities. In this work, we contribute a large-scale human interaction dataset, namely FT-HID dataset. FT-HID contains pair-aligned samples of first-person and third-person visions. The dataset was collected from 109 distinct subjects and has more than 90K samples for three modalities. The dataset has been validated by using several existing action recognition methods. In addition, we introduce a novel multi-view interaction mechanism for skeleton sequences, and a joint learning multi-stream framework for first-person and third-person visions. Both methods yield promising results on the FT-HID dataset. It is expected that the introduction of this vision-aligned large-scale dataset will promote the development of both FPV and TPV, and their joint learning techniques for human action analysis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FT-HID: a large-scale RGB-D dataset for first- and third-person human interaction analysis

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications

Lead the way for us

Journal: Neural Computing and Applications	Publication Date: Oct 7, 2022
Citations: 3

Similar Papers

An Introduction to the 3rd Workshop on Egocentric (First-Person) Vision
Steve Mann ... Kris M Kitani
-
Steve Mann, et. al.Steve Mann ... Kris M Kitani
01 Jun 2014
01 Jun 2014

A Pointing Gesture Based Egocentric Interaction System: Dataset, Approach and Application
Yichao Huang ... Lianwen Jin
-
Yichao Huang, et. al.Yichao Huang ... Lianwen Jin
01 Jun 2016
01 Jun 2016

RGB-D sensing based human action and interaction analysis: A survey
Bangli Liu ... Honghai Liu
Pattern Recognition | VOL. 94
Bangli Liu, et. al.Bangli Liu ... Honghai Liu
11 May 2019
Pattern Recognition | VOL. 94

Predicting the future from first person (egocentric) vision: A survey
Ivan Rodin ... Giovanni Maria Farinella
Computer Vision and Image Understanding | VOL. 211
Ivan Rodin, et. al.Ivan Rodin ... Giovanni Maria Farinella
04 Aug 2021
Computer Vision and Image Understanding | VOL. 211

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FT-HID: a large-scale RGB-D dataset for first- and third-person human interaction analysis

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications