Abstract
Accuracy is of vital importance for real-time semantic segmentation. However, modern methods often weaken high-level or low-level feature extraction to promote inference speed, thereby resulting in poor accuracy. In this paper, we present a Multi-level Feature Enhancement Network (MFENet) to enhance the feature extraction of each level in backbone. This approach can achieve high performance while maintaining high inference speed. We first rely on a Spatial and Edge Extraction Module with the Laplace Operator to improve the edge information extraction of low-level features. Next, we design a Context Boost Module to increase the context information inside each object of high-level features. Finally, we introduce the Selective Refinement Module to selectively combine the information from these two modules. Our network attained precise real-time segmentation results on Cityscapes, CamVid and COCO-Stuff datasets. More specifically, the architecture achieved 76.7% Mean IoU on the Cityscapes test dataset with 12.5 GFLOPS and a speed of 47 FPS on one NVIDIA Titan Xp card, which is more accurate than existing real-time methods.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.