Coarse-to-Fine Adaptive People Detection for Video Sequences by Maximizing Mutual Information †.

Álvaro García-Martín,José Martínez,Juan Sanmiguel

doi:10.3390/s19010004

Álvaro García-Martín, José Martínez + Show 1 more

Open Access

https://doi.org/10.3390/s19010004

Copy DOI

Journal: Sensors (Basel, Switzerland)	Publication Date: Dec 20, 2018
Citations: 2	License type: CC BY 4.0

Affiliation: Autonomous University of Madrid

Abstract

Applying people detectors to unseen data is challenging since patterns distributions, such as viewpoints, motion, poses, backgrounds, occlusions and people sizes, may significantly differ from the ones of the training dataset. In this paper, we propose a coarse-to-fine framework to adapt frame by frame people detectors during runtime classification, without requiring any additional manually labeled ground truth apart from the offline training of the detection model. Such adaptation make use of multiple detectors mutual information, i.e., similarities and dissimilarities of detectors estimated and agreed by pair-wise correlating their outputs. Globally, the proposed adaptation discriminates between relevant instants in a video sequence, i.e., identifies the representative frames for an adaptation of the system. Locally, the proposed adaptation identifies the best configuration (i.e., detection threshold) of each detector under analysis, maximizing the mutual information to obtain the detection threshold of each detector. The proposed coarse-to-fine approach does not require training the detectors for each new scenario and uses standard people detector outputs, i.e., bounding boxes. The experimental results demonstrate that the proposed approach outperforms state-of-the-art detectors whose optimal threshold configurations are previously determined and fixed from offline training data.

Highlights

Automatic people detection in video sequences is one of the most relevant problems in computer vision, which is essential in many applications such as for video-surveillance, human–computer interaction and mobile robotics
We classify the frame based on the evidence provided by the entropy E, we evaluate the posterior probability of each class P(qi | E ) and we choose the class with largest P(qi | E ), i.e., ω1
We proposed the estimation of the absence/presence of people for each frame, using the entropy of the correlation map Cn,m

Summary

Introduction

Automatic people detection in video sequences is one of the most relevant problems in computer vision, which is essential in many applications such as for video-surveillance, human–computer interaction and mobile robotics. Our proposal explores multiple thresholding hypotheses for all employed detectors and exploits pair-wise correlations between their outputs within a coarse-to-fine adaptation strategy. A fine adaptation stage is performed for frames where people are present by optimally selecting the detection threshold for each detector. Such selection is performed for each detector by accumulating all pair-wise comparisons with other detectors. It can be applied to many recent approaches, as demonstrated by the experimental results, which show that adapting sets of people detectors (from two to six) outperforms individual detectors tuned to obtain maximum performance (i.e., whose threshold is trained offline and fixed in advance).

State of the Art

Detector Adaptation Framework

Cross-Correlation of Detectors

Pair-Wise Correlation

Coarse Adaptation

Fine Adaptation

Experimental Results

Coarse Adaptation Results

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Coarse-to-Fine Adaptive People Detection for Video Sequences by Maximizing Mutual Information †.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Adaptive people detection based on cross-correlation maximization
Alvaro Garcia-Martin ... Juan C Sanmiguel
-
Alvaro Garcia-Martin, et. al.Alvaro Garcia-Martin ... Juan C Sanmiguel
01 Sep 2017
01 Sep 2017

Shot detection in video sequences using entropy based metrics
Z Cernekova ... I Pitas
-
Z Cernekova, et. al.Z Cernekova ... I Pitas
24 Jun 2002
24 Jun 2002

A rank minimization approach to fast dynamic event detection and track matching in video sequences
Tao Ding ... Mario Sznaier
-
Tao Ding, et. al. Tao Ding ... Mario Sznaier
01 Jan 2007
01 Jan 2007

Spatial-Temporal Granularity-Tunable Gradients Partition (STGGP) Descriptors for Human Detection
Yazhou Liu ... Matti Pietikainen
-
Yazhou Liu, et. al.Yazhou Liu ... Matti Pietikainen
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Coarse-to-Fine Adaptive People Detection for Video Sequences by Maximizing Mutual Information †.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)