Multi-Head Attention Refiner for Multi-View 3D Reconstruction.

Kyunghee Lee,Ihjoon Cho,Boseung Yang,Unsang Park

doi:10.3390/jimaging10110268

Abstract

Traditional 3D reconstruction models have consistently faced the challenge of balancing high recall of object edges with maintaining a high precision. In this paper, we introduce a post-processing method, the Multi-Head Attention Refiner (MA-R), designed to address this issue by integrating a multi-head attention mechanism into the U-Net style refiner module. Our method demonstrates improved capability in capturing intricate image details, leading to significant enhancements in boundary predictions and recall rates. In our experiments, the proposed approach notably improves the reconstruction performance of Pix2Vox++ when multiple images are used as the input. Specifically, with 20-view images, our method achieves an IoU score of 0.730, a 1.1% improvement over the 0.719 of Pix2Vox++, and a 2.1% improvement in F-Score, achieving 0.483 compared to 0.462 of Pix2Vox++. These results underscore the robustness of our approach in enhancing both precision and recall in 3D reconstruction tasks involving multiple views.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Head Attention Refiner for Multi-View 3D Reconstruction.

Abstract

Talk to us

Similar Papers

More From: Journal of imaging

Lead the way for us

Journal: Journal of imaging	Publication Date: Oct 24, 2024
License type: cc-by

Similar Papers

Research on 3D reconstruction of human face based on single image
Chaoying Zhang ... Baolin Liang
-
Chaoying Zhang, et. al.Chaoying Zhang ... Baolin Liang
15 Oct 2021
15 Oct 2021

Learning stratified 3D reconstruction
Qiulei Dong ... Mao Shu
Science China Information Sciences | VOL. 61
Qiulei Dong, et. al.Qiulei Dong ... Mao Shu
26 Dec 2017
Science China Information Sciences | VOL. 61

Multi-dimensional fusion: transformer and GANs-based multimodal audiovisual perception robot for musical performance art.
Shiyi Lu ... Panpan Wang
Frontiers in neurorobotics | VOL. 17
Shiyi Lu, et. al.Shiyi Lu ... Panpan Wang
29 Sep 2023
Frontiers in neurorobotics | VOL. 17

Chemical-protein interaction extraction via contextualized word representations and multihead attention.
Yijia Zhang ... Jian Wang
Database | VOL. 2019
Yijia Zhang, et. al.Yijia Zhang ... Jian Wang
01 Jan 2019
Database | VOL. 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Head Attention Refiner for Multi-View 3D Reconstruction.

Abstract

Talk to us

Similar Papers

More From: Journal of imaging