Realtime Global Attention Network for Semantic Segmentation

Xi Mo,Xiangyu Chen

doi:10.1109/lra.2022.3140443

Abstract

In this letter, we proposed an end-to-end realtime global attention neural network (RGANet) for the challenging task of semantic segmentation. Different from the encoding strategy deployed by self-attention paradigms, the proposed global attention module encodes global attention via depthwise convolution and affine transformations. The integration of these global attention modules into a hierarchical architecture maintains high inferential performance. In addition, an improved evaluation metric, namely MGRID, is proposed to alleviate the negative effect of non-convex, widely scattered ground-truth areas. Results from extensive experiments on state-of-the-art architectures for suction region segmentation manifest the leading performance of proposed approaches for robotic monocular visual perception.

Full Text