MeT: A graph transformer for semantic segmentation of 3D meshes

Giuseppe Vecchio,Luca Prezzavento,Carmelo Pino,Francesco Rundo,Simone Palazzo,Concetto Spampinato

doi:10.1016/j.cviu.2023.103773

Giuseppe Vecchio, Luca Prezzavento + Show 4 more

Open Access

https://doi.org/10.1016/j.cviu.2023.103773

Copy DOI

Abstract

Polygonal meshes have become the standard for discretely approximating 3D shapes, thanks to their efficiency and high flexibility in capturing non-uniform shapes. This non-uniformity, however, leads to irregularity in the mesh structure, making tasks like segmentation of 3D meshes particularly challenging. Semantic segmentation of 3D mesh has been typically addressed through CNN-based approaches, leading to good accuracy. Recently, transformers have gained enough momentum both in NLP and computer vision fields, achieving performance at least on par with CNN models, supporting the long-sought architecture universalism. Following this trend, we propose a transformer-based method for semantic segmentation of 3D mesh motivated by a better modeling of the graph structure of meshes, by means of global attention mechanisms. In order to address the limitations of standard transformer architectures in modeling relative positions of non-sequential data, as in the case of 3D meshes, as well as in capturing the local context, we perform positional encoding by means the Laplacian eigenvectors of the adjacency matrix, replacing the traditional sinusoidal positional encodings, and by introducing clustering-based features into the self-attention and cross-attention operators. Experimental results, carried out on three sets of the Shape COSEG Dataset (Wang et al., 2012), on the human segmentation dataset proposed in Maron et al. (2017) and on the ShapeNet benchmark (Chang et al., 2015), show how the proposed approach yields state-of-the-art performance on semantic segmentation of 3D meshes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computer Vision and Image Understanding	Publication Date: Jul 13, 2023
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

MeT: A graph transformer for semantic segmentation of 3D meshes

Abstract

Talk to us

Similar Papers

More From: Computer Vision and Image Understanding

Lead the way for us

Similar Papers

3D Mesh Segmentation Based on Unsupervised Clustering
Dina Khattab ... Ashraf S Hussein
-
Dina Khattab, et. al.Dina Khattab ... Ashraf S Hussein
18 Oct 2016
18 Oct 2016

SCMS-Net: Self-Supervised Clustering-Based 3D Meshes Segmentation Network
Xue Jiao ... Xiaohui Yang
Computer-Aided Design | VOL. 160
Xue Jiao, et. al.Xue Jiao ... Xiaohui Yang
11 Mar 2023
Computer-Aided Design | VOL. 160

Deep learning-based semantic segmentation of urban-scale 3D meshes in remote sensing: A survey
Jibril Muhammad Adam ... Weiquan Liu
International Journal of Applied Earth Observation and Geoinformation | VOL. 121
Jibril Muhammad Adam, et. al.Jibril Muhammad Adam ... Weiquan Liu
01 Jul 2023
International Journal of Applied Earth Observation and Geoinformation | VOL. 121

Segmentation of 3D meshes combining the artificial neural network classifier and the spectral clustering
F Zakani ... T Gadi
Computer Optics | VOL. 42
F Zakani, et. al.F Zakani ... T Gadi
01 Jan 2018
Computer Optics | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MeT: A graph transformer for semantic segmentation of 3D meshes

Abstract

Talk to us

Similar Papers

More From: Computer Vision and Image Understanding