Adapting RGB Pose Estimation to New Domains

Gururaj Mulay,Bruce A Draper,J Ross Beveridge

doi:10.1109/ccwc.2019.8666594

Abstract

Many multi-modal human computer interaction (HCI) systems interact with users in real-time by estimating the user’s pose. Generally, they estimate human poses using depth sensors such as the Microsoft Kinect. For multi-modal HCI interfaces to gain traction in the real world, however, it would be better for pose estimation to be based on data from RGB cameras, which are more common and less expensive than depth sensors. This has motivated research into pose estimation from RGB images. Convolutional Neural Networks (CNNs) represent the state-of-the-art in this literature, for example [1], [2], [9], [13], [14], and [15]. These systems estimate 2D human poses from RGB images. A problem with current CNN-based pose estimators is that they require large amounts of labeled data for training. If the goal is to train an RGB pose estimator for a new domain, the cost of collecting and more importantly labeling data can be prohibitive. A common solution is to train on publicly available pose data sets, but then the trained system is not tailored to the domain. We propose using RGB+D sensors to collect domain-specific data in the lab, and then training the RGB pose estimator using skeletons automatically extracted from the RGB+D data. This paper presents a case study of adapting the RMPE pose estimation network [2] to the domain of the DARPA Communicating with Computers (CWC) program [3], as represented by the EGGNOG data set [8]. We chose RMPE because it predicts both joint locations and Part Affinity Fields (PAFs) in real-time. Our adaptation of RMPE trained on automatically-labeled data outperforms the original RMPE on the EGGNOG data set.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adapting RGB Pose Estimation to New Domains

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

BDR6D: Bidirectional Deep Residual Fusion Network for 6D Pose Estimation
Penglei Liu ... Qieshi Zhang
IEEE Transactions on Automation Science and Engineering | VOL. 21
Penglei Liu, et. al.Penglei Liu ... Qieshi Zhang
01 Apr 2024
IEEE Transactions on Automation Science and Engineering | VOL. 21

Simultaneously-Collected Multimodal Lying Pose Dataset: Enabling In-Bed Human Pose Monitoring.
Shuangjun Liu ... Xiaofei Huang
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Shuangjun Liu, et. al.Shuangjun Liu ... Xiaofei Huang
01 Jan 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Robust 2D human upper-body pose estimation with fully convolutional network
...
-
, et. al. ...
01 Jun 2018
01 Jun 2018

Enhanced RGB-D Mapping Method for Detailed 3D Indoor and Outdoor Modeling
Shengjun Tang ... Bo Wu
Sensors | VOL. 16
Shengjun Tang, et. al.Shengjun Tang ... Bo Wu
27 Sep 2016
Sensors | VOL. 16

Publication Date: Jan 1, 2019
Citations: 10	License type: mit

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adapting RGB Pose Estimation to New Domains

Abstract

Talk to us

Similar Papers