Color-depth Camera Research Articles

Human activity recognition has a variety of important real-world applications, such as video analysis, surveillance, and human-robot interaction. As a promising video representation method, local spatio-temporal (LST) features have received increasing attention from computer vision, machine learning, and robotics communities. However, approaches based on traditional LST features only use color information, which face several challenges, such as illumination changes and dynamic backgrounds. The recent availability of commercial color-depth cameras makes it much cheaper, faster, and easier to acquire depth information, which provides a potential to implement more discriminative and robust LST features. In this paper, we introduce the new 4-D color-depth (CoDe4D) LST feature that incorporates both intensity and depth information acquired from RGB-D cameras. Our feature detector constructs a saliency map through applying independent filters in $\boldsymbol {xyzt}$ dimension to represent texture, shape and pose variations, and selects its local maxima as interest points. Our multichannel orientation histogram descriptor applies a 4-D support region, which is adaptive to linear perspective view changes, on each interest point. Then, image gradients of color-depth patches within the support region are computed and quantized using a spherical coordinate-based method to form a final feature vector. We build a complete activity recognition system by combining our features with bag-of-features representations and support vector machines. To evaluate the performance of our CoDe4D LST features and the complete system, we conduct experiments using four benchmark color-depth human activity data sets, including UTK Action3-D, Berkeley MHAD, ACT $\boldsymbol {4^{2^{^{^{}}}}}$ , and MSR daily activity 3-D data sets. Experimental results demonstrate the promising representative power of our CoDe4D features, which obtain the state-of-the-art performance on activity recognition from RGB-D visual data.

Read full abstract

The ability to perceive humans is an essential requirement for safe and efficient human-robot interaction. In real-world applications, the need for a robot to interact in real time with multiple humans in a dynamic, 3-D environment presents a significant challenge. The recent availability of commercial color-depth cameras allow for the creation of a system that makes use of the depth dimension, thus enabling a robot to observe its environment and perceive in the 3-D space. Here we present a system for 3-D multiple human perception in real time from a moving robot equipped with a color-depth camera and a consumer-grade computer. Our approach reduces computation time to achieve real-time performance through a unique combination of new ideas and established techniques. We remove the ground and ceiling planes from the 3-D point cloud input to separate candidate point clusters. We introduce the novel information concept, depth of interest, which we use to identify candidates for detection, and that avoids the computationally expensive scanning-window methods of other approaches. We utilize a cascade of detectors to distinguish humans from objects, in which we make intelligent reuse of intermediary features in successive detectors to improve computation. Because of the high computational cost of some methods, we represent our candidate tracking algorithm with a decision directed acyclic graph, which allows us to use the most computationally intense techniques only where necessary. We detail the successful implementation of our novel approach on a mobile robot and examine its performance in scenarios with real-world challenges, including occlusion, robot motion, nonupright humans, humans leaving and reentering the field of view (i.e., the reidentification challenge), human-object and human-human interaction. We conclude with the observation that the incorporation of the depth information, together with the use of modern techniques in new ways, we are able to create an accurate system for real-time 3-D perception of humans by a mobile robot.

Read full abstract

Color-depth Camera Research Articles

Related Topics

Articles published on Color-depth Camera

Latent Space Representations for Marker-Less Realtime Hand-Eye Calibration.

Estimation of human body 3D pose for parent-infant interaction settings using azure Kinect and OpenPose

Collision-Free Navigation in Human-Following Task Using a Cognitive Robotic System on Differential Drive Vehicles

Approach for accurate calibration of RGB-D cameras using spheres.

A Comparative Study of Markerless Systems Based on Color-Depth Cameras, Polymer Optical Fiber Curvature Sensors, and Inertial Measurement Units: Towards Increasing the Accuracy in Joint Angle Estimation

Robust Intrinsic and Extrinsic Calibration of RGB-D Cameras

Application of Motion Capture Attributes to Individual Identification under Corridor Surveillance

CoDe4D: Color-Depth Local Spatio-Temporal Features for Human Activity Recognition From RGB-D Videos

An ultra-fast human detection method for color-depth camera

Real-Time Multiple Human Perception With Color-Depth Cameras on a Mobile Robot

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Color-depth Camera Research Articles

Related Topics

Articles published on Color-depth Camera

Latent Space Representations for Marker-Less Realtime Hand-Eye Calibration.

Estimation of human body 3D pose for parent-infant interaction settings using azure Kinect and OpenPose

Collision-Free Navigation in Human-Following Task Using a Cognitive Robotic System on Differential Drive Vehicles

Approach for accurate calibration of RGB-D cameras using spheres.

A Comparative Study of Markerless Systems Based on Color-Depth Cameras, Polymer Optical Fiber Curvature Sensors, and Inertial Measurement Units: Towards Increasing the Accuracy in Joint Angle Estimation

Robust Intrinsic and Extrinsic Calibration of RGB-D Cameras

Application of Motion Capture Attributes to Individual Identification under Corridor Surveillance

CoDe4D: Color-Depth Local Spatio-Temporal Features for Human Activity Recognition From RGB-D Videos

An ultra-fast human detection method for color-depth camera

Real-Time Multiple Human Perception With Color-Depth Cameras on a Mobile Robot