Single RGB-D Image Research Articles

3D symmetry detection is a fundamental problem in computer vision and graphics. Most prior works detect symmetry when the object model is fully known, few studies symmetry detection on objects with partial observation, such as single RGB-D images. Recent work addresses the problem of detecting symmetries from incomplete data with a deep neural network by leveraging the dense and accurate symmetry annotations. However, due to the tedious labeling process, full symmetry annotations are not always practically available. In this work, we present a 3D symmetry detection approach to detect symmetry from single-view RGB-D images without using symmetry supervision. The key idea is to train the network in a weakly-supervised learning manner to complete the shape based on the predicted symmetry such that the completed shape be similar to existing plausible shapes. To achieve this, we first propose a discriminative variational autoencoder to learn the shape prior in order to determine whether a 3D shape is plausible or not. Based on the learned shape prior, a symmetry detection network is present to predict symmetries that produce shapes with high shape plausibility when completed based on those symmetries. Moreover, to facilitate end-to-end network training and multiple symmetry detection, we introduce a new symmetry parametrization for the learning-based symmetry estimation of both reflectional and rotational symmetry. The proposed approach, coupled symmetry detection with shape completion, essentially learns the symmetry-aware shape prior, facilitating more accurate and robust symmetry detection. Experiments demonstrate that the proposed method is capable of detecting reflectional and rotational symmetries accurately, and shows good generality in challenging scenarios, such as objects with heavy occlusion and scanning noise. Moreover, it achieves state-of-the-art performance, improving the F1-score over the existing supervised learning method by 2%-11% on the ShapeNet and ScanNet datasets.

Robotic grasp in complex open-world scenarios requires an effective and generalizable perception. Estimating object’s pose is needed in a variety of practical grasping scenarios. Here we present a novel approach of pose estimation of textureless and textured objects. The algorithm utilizes a single RGB-D image to exploit depth invariant, oriented point pair feature as well as local contextual sensitivity in cluttered environments. To enhance the performance of the voting process and improve learning efficiency, we employ a global pruning algorithm that reduces the risk of overfitting and simplifies the structure of decision trees after compensating for the complementary information among multiple trees by optimizing a designed global objective function. Finally, we also refine the pose obtained from the above stage. The proposed approach of estimating 6-D (degree of freedom) poses of textured and textureless objects is evaluated on publicly available data sets against the recent works under various conditions. It illustrates that our framework is superior to these recent works. Further, we perform extensive qualitative experiments of robotic grasp to illustrate the proposed approach can be applied to practical scenarios. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Note to Practitioners</i> —This article is motivated by the problem of the pose estimation of textured and textureless objects in clutter environments. It is difficult for conventional works to address the issue of estimating textured or textureless objects’ poses in such scenarios. We considered that a novel system should be able to obtain the 6-D poses of objects. Therefore, we investigate the combined use of multiple split functions with different characteristics. Learning the model based on Hough forests always cost much computational resource; therefore, we construct a novel pruned Hough forest for solving this issue. Through the comparison and robotic grasp verifications, the behavior of our system can be used in practical applications. In future, we will deploy the proposed system in robotic assembling tasks.

Single RGB-D Image Research Articles

Related Topics

Articles published on Single RGB-D Image

MaskRecon: High-quality human reconstruction via masked autoencoders using a single RGB-D image

Head model reconstruction and animation method using color image with depth information

Point Cloud Scene Completion with Joint Color and Semantic Estimation from Single RGB-D Image.

Neural-Based Detection and Segmentation of Articulated Objects for Robotic Interaction in a Household Environment

Learning to Detect 3D Symmetry From Single-View RGB-D Images With Weak Supervision.

Robust Symmetry Prediction with Multi-Modal Feature Fusion for Partial Shapes

Multiple geometry representations for 6D object pose estimation in occluded or truncated scenes

PANet: A Pixel-Level Attention Network for 6D Pose Estimation With Embedding Vector Features

A dynamic keypoint selection network for 6DoF pose estimation

A 3D Keypoints Voting Network for 6DoF Pose Estimation in Indoor Scene

DDGC: Generative Deep Dexterous Grasping in Clutter

Real-Time Block-Based Embedded CNN for Gesture Classification on an FPGA

Object Pose Estimation via Pruned Hough Forest With Combined Split Schemes for Robotic Grasp

Sparse intrinsic decomposition and applications

Unreal mask: one-shot multi-object class-based pose estimation for robotic manipulation using keypoints with a synthetic dataset

L6DNet: Light 6 DoF Network for Robust and Precise Object Pose Estimation With Small Datasets

Graph neural network for 6D object pose estimation

The design of tourism product CAD three-dimensional modeling system using VR technology.

6DoF Pose Estimation of Transparent Object from a Single RGB-D Image.

Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Single RGB-D Image Research Articles

Related Topics

Articles published on Single RGB-D Image

MaskRecon: High-quality human reconstruction via masked autoencoders using a single RGB-D image

Head model reconstruction and animation method using color image with depth information

Point Cloud Scene Completion with Joint Color and Semantic Estimation from Single RGB-D Image.

Neural-Based Detection and Segmentation of Articulated Objects for Robotic Interaction in a Household Environment

Learning to Detect 3D Symmetry From Single-View RGB-D Images With Weak Supervision.

Robust Symmetry Prediction with Multi-Modal Feature Fusion for Partial Shapes

Multiple geometry representations for 6D object pose estimation in occluded or truncated scenes

PANet: A Pixel-Level Attention Network for 6D Pose Estimation With Embedding Vector Features

A dynamic keypoint selection network for 6DoF pose estimation

A 3D Keypoints Voting Network for 6DoF Pose Estimation in Indoor Scene

DDGC: Generative Deep Dexterous Grasping in Clutter

Real-Time Block-Based Embedded CNN for Gesture Classification on an FPGA

Object Pose Estimation via Pruned Hough Forest With Combined Split Schemes for Robotic Grasp

Sparse intrinsic decomposition and applications

Unreal mask: one-shot multi-object class-based pose estimation for robotic manipulation using keypoints with a synthetic dataset

L6DNet: Light 6 DoF Network for Robust and Precise Object Pose Estimation With Small Datasets

Graph neural network for 6D object pose estimation

The design of tourism product CAD three-dimensional modeling system using VR technology.

6DoF Pose Estimation of Transparent Object from a Single RGB-D Image.

Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments