Abstract
Foreground segmentation models are designed to extract moving objects of varying sizes from the background, which can benefit from representations of various scales. As an effective module for capturing multi-scale contexts, Atrous Spatial Pyramid Pooling (ASPP) convolves a final feature representation via multiple parallel atrous convolutions with different dilation rates. However, as the dilation rate increases, ASPP gradually loses its large-scale modeling ability because the sampling of atrous kernel becomes progressively sparse within the receptive field. To solve this problem, we design a CompactASPP module to convolve feature maps compactly. Without significantly increasing the module size, the CompactASPP can encode multi-scale features from all neurons within the receptive field rather than from neurons in several sparsely distributed positions. Furthermore, we leverage CompactASPP modules to enhance our previous X-Net. The proposed Fast X-Net substantially improves the segmentation speed by over 63.6% and attains new state-of-the-art performances on CDnet2014, SBI2015 and UCSD benchmarks.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Engineering Applications of Artificial Intelligence
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.