The 3D point cloud is a common 3D data representation that has received increasing attention for remote sensing applications. However, processing 3D point cloud semantics, especially local semantic information, has always been a challenge and has attracted much attention. In this paper, we propose a novel enhanced local semantic learning transformer for 3D point cloud analysis, which aims to enhance the transformer awareness of local semantic features to handle complex point cloud tasks. First, we propose a novel transformer framework, the local semantic learning point cloud transformer (LSLPCT), which not only learns 3D point clouds the global information, but also enhances the perception of local semantic information end-to-end. Second, we design an efficient local semantic learning self-attention mechanism, namely LSL-SA, which can parallelize the perception of global contextual information and the capture of finer-grained local semantic features. Third, our proposed LSL-SA is easy to implement and can integrate existing transformers and CNN-based networks for processing various point cloud tasks. Numerous experiments in different types of point cloud tasks have been conducted, and our method performs better or is competitive with other state-of-the-art methods.
Read full abstract