Video Polyp Segmentation: A Deep Learning Perspective

Ge-Peng Ji,Kai Zhao,Deng-Ping Fan,Guobao Xiao,Yu-Cheng Chou,Luc Van Gool,Geng Chen

doi:10.1007/s11633-022-1371-y

Abstract

We present the first comprehensive video polyp segmentation (VPS) study in the deep learning era. Over the years, developments in VPS are not moving forward with ease due to the lack of a large-scale dataset with fine-grained segmentation annotations. To address this issue, we first introduce a high-quality frame-by-frame annotated VPS dataset, named SUN-SEG, which contains 158 690 colonoscopy video frames from the well-known SUN-database. We provide additional annotation covering diverse types, i.e., attribute, object mask, boundary, scribble, and polygon. Second, we design a simple but efficient baseline, named PNS+, which consists of a global encoder, a local encoder, and normalized self-attention (NS) blocks. The global and local encoders receive an anchor frame and multiple successive frames to extract long-term and short-term spatial-temporal representations, which are then progressively refined by two NS blocks. Extensive experiments show that PNS+ achieves the best performance and real-time inference speed (170 fps), making it a promising solution for the VPS task. Third, we extensively evaluate 13 representative polyp/object segmentation models on our SUN-SEG dataset and provide attribute-based comparisons. Finally, we discuss several open issues and suggest possible research directions for the VPS community. Our project and dataset are publicly available at https://github.com/GewelsJI/VPS.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Intelligence Research	Publication Date: Nov 3, 2022
Citations: 55	License type: open-access

R Discovery Prime

R Discovery Prime

Video Polyp Segmentation: A Deep Learning Perspective

Abstract

Talk to us

Similar Papers

More From: Machine Intelligence Research

Lead the way for us

Similar Papers

SSP-Pose: Symmetry-Aware Shape Prior Deformation for Direct Category-Level Object Pose Estimation
Ruida Zhang ... Fabian Manhardt
-
Ruida Zhang, et. al.Ruida Zhang ... Fabian Manhardt
23 Oct 2022
23 Oct 2022

Accurate Facial Image Parsing at Real-Time Speed.
Zhen Wei ... Hefei Ling
IEEE Transactions on Image Processing | VOL. 28
Zhen Wei, et. al.Zhen Wei ... Hefei Ling
09 Apr 2019
IEEE Transactions on Image Processing | VOL. 28

An Efficient Semantic Segmentation Method using Pyramid ShuffleNet V2 with Vortex Pooling
Jiansheng Dong ... Xian Zhong
-
Jiansheng Dong, et. al.Jiansheng Dong ... Xian Zhong
01 Nov 2019
01 Nov 2019

FocAnnot: Patch-Wise Active Learning for Intensive Cell Image Segmentation
Bo Lin ... Honghao Gao
-
Bo Lin, et. al.Bo Lin ... Honghao Gao
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Video Polyp Segmentation: A Deep Learning Perspective

Abstract

Talk to us

Similar Papers

More From: Machine Intelligence Research