Point cloud single-view reconstruction (PC-SVR) generates high-quality point clouds from low-cost 2D images, offering a cost-effective solution to the expensive and inefficient acquisition of point cloud for real-world scene typically achieved through expensive LiDAR systems. However, current models that generate point clouds from real-world 2D images (e.g. maritime vessels) still have shortcomings in terms of quality and model generalization. In this paper, we proposed a image-conditioned denoising diffusion probabilistic model (ICDDPM) for real-world complex point cloud single view reconstruction to address these issues. We re-designed the structure of classic diffusion model, use latent shape vectors to seamlessly integrate 2D image encoder, point cloud encoder, and conditional diffusion model, to cater PC-SVR task. By guiding the diffusion process with the 2D images, which serve as crucial conditional information, ICDDPM achieves end-to-end point cloud generation with superior quality. 2D images are also employed as input in the reverse diffusion process to further achieve point cloud generation. We conducted qualitative and quantitative experiments on synthetic dataset ShapeNet and real-world dataset PASCAL3D+ (focused on experiments of vessel point cloud data specifically). The results indicate that the ICDDPM model demonstrates superior performance compared to state-of-the-art models. It is capable of generating point clouds with a greater level of global and local details from various 2D image data. Additionally, the model exhibits strong generalization abilities and requires fewer computational resources.
Read full abstract