Abstract

Existing real-time semantic segmentation models usually pursue lower inference costs by compressing spatial resolution, reducing the number of feature extraction layers, or simplifying the high-resolution generation process. This inevitably leads to huge information loss and ultimately affects segmentation accuracy. To address the problem, a HoloParser is proposed to learn and parse the input image from a holistic perspective of the entire segmentation pipeline. First, in the image preprocessing part, the often-overlooked problem of structural information loss in fast downsampling is highlighted and studied. For this, a nonstrided fast downsampling (NFD) module is devised for holo-structure retention. Second, for the resolution change processes in segmentation networks, the spatial correlation between up/down-samplings is revisited and focused. And a lossless sample pair (LSP) is proposed for holo-correlation modeling and retrieving. Third, in feature extraction, after reconsidering the necessity and redundancy of multiscale learning, a distributed pyramid learning (DPL) module is proposed for lightweight holo-feature extraction with composite receptive fields (RFs). Finally, for the high-resolution output generation, a parallel resolution maintenance (PRM) module is put forward via high-low semantics aggregation across stages for holo-semantics utilization. Extensive experiments demonstrate the effectiveness and superiority of the proposed modules. The HoloParser achieves state-of-the-art performance in real-time semantic segmentation, which reports 76.45% mIoU at 280+ frames/s on the Cityscapes dataset.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.