3D Object Classification Using a Volumetric Deep Neural Network: An Efficient Octree Guided Auxiliary Learning Approach

A A M Muzahid,Ofelia Delfina Cervantes Villagomez,Hidayat Ullah,Ferdous Sohel,Naimat Ullah Khan,Wanggen Wan

doi:10.1109/access.2020.2968506

A A M Muzahid, Ofelia Delfina Cervantes Villagomez + Show 4 more

Open Access

https://doi.org/10.1109/access.2020.2968506

Copy DOI

Abstract

We consider the recent challenges of 3D shape analysis based on a volumetric CNN that requires a huge computational power. This high-cost approach forces to reduce the volume resolutions when applying 3D CNN on volumetric data. In this context, we propose a multiorientation volumetric deep neural network (MV-DNN) for 3D object classification with octree generating low-cost volumetric features. In comparison to conventional octree representations, we propose to limit the octree partition to a certain depth to reserve all leaf octants with sparsity features. This allows for improved learning of complex 3D features and increased prediction of object labels at both low and high resolutions. Our auxiliary learning approach predicts object classes based on the subvolume parts of a 3D object that improve the classification accuracy compared to other existing 3D volumetric CNN methods. In addition, the influence of views and depths of the 3D model on the classification performance is investigated through extensive experiments applied to the ModelNet40 database. Our deep learning framework runs significantly faster and consumes less memory than full voxel representations and demonstrate the effectiveness of our octree-based auxiliary learning approach for exploring high resolution 3D models. Experimental results reveal the superiority of our MV-DNN that achieves better classification accuracy compared to state-of-art methods on two public databases.

Highlights

The rapid development of consumer depth cameras, 3D acquisition and scanning devices make it easier to obtain a 3D view of a real object that is tremendously increasing many real world applications [1] within the areas of computer vision, online gaming, films, TV, engineering project modeling, biology, military research and many more applications in the field of visual reality
Object classification is a key application of computer vision areas, whereas vision systems are built using the theory of artificial intelligence (AI) systems that machines can recognize what is perceived similar to the human visual system
Considering the deep neural network and computational limitations of voxel representations, this paper proposes an VOLUME 8, 2020 effective octree-based auxiliary learning approach for 3D object classification based on a multiorientation volumetric deep neural network (MV-DNN)

Summary

INTRODUCTION

The rapid development of consumer depth cameras, 3D acquisition and scanning devices make it easier to obtain a 3D view of a real object that is tremendously increasing many real world applications [1] within the areas of computer vision, online gaming, films, TV, engineering project modeling, biology, military research and many more applications in the field of visual reality. Considering the deep neural network and computational limitations of voxel representations, this paper proposes an VOLUME 8, 2020 effective octree-based auxiliary learning approach for 3D object classification based on a multiorientation volumetric deep neural network (MV-DNN). We propose to preserve all octants information to a certain partition level based on the predefined input voxel resolution to store high-precision contour features at the beginning of the octree partition This effective feature helps to enhance the geometric resolution to the convolution filter and improve the classifier performance compared with conventional octree representation. Our proposed MV-DNN is a GPU-based volumetric deep convolutional neural network that directly inputs octree structures of a 3D object.

RELATED WORK

CLASSIFICATION EXPERIMENTS

TRAINING DETAILS

Findings

CONCLUSIONS AND FUTURE WORKS