Spatio-temporal Matching for Human Detection in Video

Feng Zhou,Fernando De La Torre

doi:10.1007/978-3-319-10599-4_5

Abstract

Detection and tracking humans in videos have been long-standing problems in computer vision. Most successful approaches (e.g., deformable parts models) heavily rely on discriminative models to build appearance detectors for body joints and generative models to constrain possible body configurations (e.g., trees). While these 2D models have been successfully applied to images (and with less success to videos), a major challenge is to generalize these models to cope with camera views. In order to achieve view-invariance, these 2D models typically require a large amount of training data across views that is difficult to gather and time-consuming to label. Unlike existing 2D models, this paper formulates the problem of human detection in videos as spatio-temporal matching (STM) between a 3D motion capture model and trajectories in videos. Our algorithm estimates the camera view and selects a subset of tracked trajectories that matches the motion of the 3D model. The STM is efficiently solved with linear programming, and it is robust to tracking mismatches, occlusions and outliers. To the best of our knowledge this is the first paper that solves the correspondence between video and 3D motion capture data for human pose detection. Experiments on the Human3.6M and Berkeley MHAD databases illustrate the benefits of our method over state-of-the-art approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Spatio-temporal Matching for Human Detection in Video

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Spatio-Temporal Matching for Human Pose Estimation in Video.
Feng Zhou ... Fernando De La Torre
IEEE transactions on pattern analysis and machine intelligence | VOL. 38
Feng Zhou, et. al.Feng Zhou ... Fernando De La Torre
04 Feb 2016
IEEE transactions on pattern analysis and machine intelligence | VOL. 38

TCM: Temporal Consistency Model for Head Detection in Complex Videos
Sultan Daud Khan ... Habib Ullah
Journal of Sensors | VOL. 2020
Sultan Daud Khan, et. al.Sultan Daud Khan ... Habib Ullah
16 Dec 2020
Journal of Sensors | VOL. 2020

Vehicle Detection Using Mixture of Deformable Parts Models: Static and Dynamic Camera
Leissi Castaneda Leon ... Roberto Hirata Jr
-
Leissi Castaneda Leon, et. al.Leissi Castaneda Leon ... Roberto Hirata Jr
01 Aug 2012
01 Aug 2012

Adaptive Structural Model for Video Based Pedestrian Detection
Junjie Yan ... Zhen Lei
-
Junjie Yan, et. al.Junjie Yan ... Zhen Lei
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spatio-temporal Matching for Human Detection in Video

Abstract

Talk to us

Similar Papers