Abstract
In this article, we propose a novel framework for camouflaged object detection (COD), named D <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$^{2}$</tex-math></inline-formula> C-Net, which contains two new modules: Dual-branch features extraction (DFE) and gradually refined cross fusion (GRCF). Specifically, the DFE simulates the two-stage detection process of human visual mechanisms in observing camouflage scenes. For the first stage, a dense concatenation is employed to aggregate multilevel features and expand the receptive field. The first stage feature maps are then utilized to extract two-direction guidance information, which benefits the second stage. The GRCF consists of a self-refine attention unit and a cross-refinement unit, with the aim of combining the peer layer features and DFE features for an improved COD performance. The proposed framework outperforms 13 state-of-the-art deep learning-based methods upon three public datasets in terms of five widely used metrics. Finally, we show evidence for the successful applications of the proposed method in the fields of surface defect detection and medical image segmentation.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.