Abstract

Reconstructing a three-dimensional (3D) shape from a single image is one of the main topics in the field of computer vision. Some of the methods for 3D reconstruction adopt machine learning. These methods use machine learning for acquiring the relationship between 3D shape and 2D image, and reconstruct 3D shapes by using the learned relationship. However, since only predefined features (pixels in the image) are used, it is not possible to obtain the desired features of the 2D image for 3D reconstruction. Therefore, this paper presents a method for reconstructing 3D shapes by learning features of 2D images using deep learning. This method uses Convolutional Neural Network (CNN) for feature learning to reconstruct a 3D shape. Pooling layers and convolutional layers of the CNN capture spatial information about images and automatically select valuable image features. This paper presents two types of the reconstruction methods. The first one is to first estimate the normal vector of the object, and then reconstruct the 3D shape from the normal vector by deep learning. The second one is direct reconstruction of the 3D shape from an image by a deep neural network. The experimental results using human face images showed that the proposed method can reconstruct 3D shapes with higher accuracy than the previous methods.

Highlights

  • Reconstructing three-dimensional (3D) shapes of objects from two-dimensional (2D) images is one of the most attractive areas of computer vision

  • This paper presents a method for 3D reconstruction from a single image by deep learning. This method adopts Convolutional Neural Network (CNN) [11, 12, 13] for feature learning because it is successful in various fields such as object recognition and semantic segmentation

  • The first one is first estimating the normal vector of the object, reconstructing the 3D shape from the normal vector by deep learning

Read more

Summary

INTRODUCTION

Reconstructing three-dimensional (3D) shapes of objects from two-dimensional (2D) images is one of the most attractive areas of computer vision. This method could reconstruct 3D shapes more accurately than the previous methods This method uses predefined feature of the image, the pixel value of the patch. This paper presents a method for 3D reconstruction from a single image by deep learning This method adopts Convolutional Neural Network (CNN) [11, 12, 13] for feature learning because it is successful in various fields such as object recognition and semantic segmentation. This method outputs the 3D coordinates of an object from the pixel values of 2D image.

Previous Works for 3D Reconstruction
Direct 3D Reconstruction Method
Experimental Setting
Experimental Results of Reconstructed Normal Vector
Experimental Results of Reconstructed 3D Shape
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call