An encoder‐decoder framework with dynamic convolution for weakly supervised instance segmentation

Liangjun Zhu,Zhongren Liu,Shuchen Ding,Li Peng

doi:10.1049/cvi2.12202

Liangjun Zhu, Zhongren Liu + Show 2 more

Open Access

https://doi.org/10.1049/cvi2.12202

Copy DOI

Abstract

AbstractIn the systems of industrial robotics and autonomous vehicles, instance segmentation is widely employed. However, manually labelling an object outline is time‐consuming. In order to reduce annotation costs, we present a weakly supervised instance segmentation method in this article. A deeply convolutional network is first used to construct multi‐scale feature maps for each object in the input image. After that, the encoder‐decoder framework with dynamic convolution is utilised to enhance model capacity and efficiency, while avoiding the issues of anchor design, proposal selection, and RoIAlign implementation. In particular, Dynamic Heads are used in the encoder to create dynamic convolution kernels, while Instance Heads are used in the decoder to provide the global feature map. With dynamic convolution, each instance can be segmented independently, reducing interference with other instances and improving segmentation accuracy. Under the supervision of projection loss and pixel point colour pairing loss, the contours of each object are finally outlined. On the PASCAL VOC and MS COCO datasets, the proposed method is competitive with more sophisticated approaches. In the VOC dataset, segmentation performance achieved 37.6% average precision with ResNet‐101 and FPN networks. The extensively visualised results demonstrate the effectiveness of the proposed encoder‐decoder framework with dynamic convolution.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An encoder‐decoder framework with dynamic convolution for weakly supervised instance segmentation

Abstract

Talk to us

Similar Papers

More From: IET Computer Vision

Lead the way for us

Journal: IET Computer Vision	Publication Date: May 2, 2023
License type: CC BY-NC-ND 4.0

Similar Papers

HISFCOS: Half-Inverted Stage Block for Efficient Object Detection Based on Deep Learning.
Beomyeon Hwang ... Seunghyun Lee
Journal of Imaging | VOL. 8
Beomyeon Hwang, et. al.Beomyeon Hwang ... Seunghyun Lee
17 Apr 2022
Journal of Imaging | VOL. 8

Ganster R-CNN: Occluded Object Detection Network Based on Generative Adversarial Nets and Faster R-CNN
Kelei Sun ... Huaping Zhou
IEEE Access | VOL. 10
Kelei Sun, et. al.Kelei Sun ... Huaping Zhou
01 Jan 2021
IEEE Access | VOL. 10

An improved YOLOv5 method for large objects detection with multi-scale feature cross-layer fusion network
Zhong Qu ... Sheng-Ye Wang
Image and Vision Computing | VOL. 125
Zhong Qu, et. al.Zhong Qu ... Sheng-Ye Wang
01 Sep 2022
Image and Vision Computing | VOL. 125

ICIoU: Improved Loss Based on Complete Intersection Over Union for Bounding Box Regression
Xufei Wang ... Jeongyoung Song
IEEE Access | VOL. 9
Xufei Wang, et. al.Xufei Wang ... Jeongyoung Song
01 Jan 2020
IEEE Access | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An encoder‐decoder framework with dynamic convolution for weakly supervised instance segmentation

Abstract

Talk to us

Similar Papers

More From: IET Computer Vision