AttaNet: Attention-Augmented Network for Fast and Accurate Scene Parsing

Qi Song,Kangfu Mei,Rui Huang

doi:10.1609/aaai.v35i3.16359

Abstract

Two factors have proven to be very important to the performance of semantic segmentation models: global context and multi-level semantics. However, generating features that capture both factors always leads to high computational complexity, which is problematic in real-time scenarios. In this paper, we propose a new model, called Attention-Augmented Network (AttaNet), to capture both global context and multi-level semantics while keeping the efficiency high. AttaNet consists of two primary modules: Strip Attention Module (SAM) and Attention Fusion Module (AFM). Viewing that in challenging images with low segmentation accuracy, there are a significantly larger amount of vertical strip areas than horizontal ones, SAM utilizes a striping operation to reduce the complexity of encoding global context in the vertical direction drastically while keeping most of contextual information, compared to the non-local approaches. Moreover, AFM follows a cross-level aggregation strategy to limit the computation, and adopts an attention strategy to weight the importance of different levels of features at each pixel when fusing them, obtaining an efficient multi-level representation. We have conducted extensive experiments on two semantic segmentation benchmarks, and our network achieves different levels of speed/accuracy trade-offs on Cityscapes, e.g., 71 FPS/79.9% mIoU, 130 FPS/78.5% mIoU, and 180 FPS/70.1% mIoU, and leading performance on ADE20K as well.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AttaNet: Attention-Augmented Network for Fast and Accurate Scene Parsing

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 45

Similar Papers

DARSegNet: A Real-Time Semantic Segmentation Method Based on Dual Attention Fusion Module and Encoder-Decoder Network
Yongfeng Xing ... Luo Zhong
Mathematical Problems in Engineering | VOL. 2022
Yongfeng Xing, et. al.Yongfeng Xing ... Luo Zhong
06 Jun 2022
Mathematical Problems in Engineering | VOL. 2022

XANet: An Efficient Remote Sensing Image Segmentation Model Using Element-Wise Attention Enhancement and Multi-Scale Attention Fusion
Chenbin Liang ... Bo Cheng
Remote sensing | VOL. 15
Chenbin Liang, et. al.Chenbin Liang ... Bo Cheng
31 Dec 2022
Remote sensing | VOL. 15

A deep learning method for optimizing semantic segmentation accuracy of remote sensing images based on improved UNet
Xiaolei Wang ... Zirong Hu
Scientific Reports | VOL. 13
Xiaolei Wang, et. al.Xiaolei Wang ... Zirong Hu
10 May 2023
Scientific Reports | VOL. 13

Research Contribution and Comprehensive Review towards the Semantic Segmentation of Aerial Images Using Deep Learning Techniques
P. Anilkumar ... Mamoun Alazab
Security and Communication Networks | VOL. 2022
P. Anilkumar, et. al.P. Anilkumar ... Mamoun Alazab
20 Mar 2022
Security and Communication Networks | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AttaNet: Attention-Augmented Network for Fast and Accurate Scene Parsing

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence