Dense Pixel-Level Interpretation of Dynamic Scenes With Video Panoptic Segmentation.

Dahun Kim,Sanghyun Woo,In So Kweon,Joon-Young Lee

doi:10.1109/tip.2022.3183440

Abstract

A holistic understanding of dynamic scenes is of fundamental importance in real-world computer vision problems such as autonomous driving, augmented reality and spatio-temporal reasoning. In this paper, we propose a new computer vision benchmark: Video Panoptic Segmentation (VPS). To study this important problem, we present two datasets, Cityscapes-VPS and VIPER together with a new evaluation metric, video panoptic quality (VPQ). We also propose VPSNet++, an advanced video panoptic segmentation network, which simultaneously performs classification, detection, segmentation, and tracking of all identities in videos. Specifically, VPSNet++ builds upon a top-down panoptic segmentation network by adding pixel-level feature fusion head and object-level association head. The former temporally augments the pixel features while the latter performs object tracking. Furthermore, we propose panoptic boundary learning as an auxiliary task, and instance discrimination learning which learns spatio-temporally clustered pixel embedding for individual thing or stuff regions, i.e., exactly the objective of the video panoptic segmentation problem. Our VPSNet++ significantly outperforms the default VPSNet, i.e., FuseTrack baseline, and achieves state-of-the-art results on both Cityscapes-VPS and VIPER datasets. The datasets, metric, and models are publicly available at https://github.com/mcahny/vps.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dense Pixel-Level Interpretation of Dynamic Scenes With Video Panoptic Segmentation.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

Lead the way for us

Journal: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society	Publication Date: Jan 1, 2022
Citations: 1

Similar Papers

A Fast Panoptic Segmentation Network for Self-Driving Scene Understanding
Abdul Majid ... Amina Jameel
Computer Systems Science and Engineering | VOL. 43
Abdul Majid, et. al.Abdul Majid ... Amina Jameel
01 Jan 2021
Computer Systems Science and Engineering | VOL. 43

Descriptor Extraction and Distance Metric Learning for a Robust Person Re-Identification System
David Nigel Fernando ... Rhandley Cajote
-
David Nigel Fernando, et. al.David Nigel Fernando ... Rhandley Cajote
01 Oct 2018
01 Oct 2018

Common Methods of Image Panoptic Segmentation Based on Deep Learning
Congcong Wang
-
Congcong WangCongcong Wang
22 Jan 2021
22 Jan 2021

Waymo Open Dataset: Panoramic Video Panoptic Segmentation
Jieru Mei ... Xinchen Yan
-
Jieru Mei, et. al.Jieru Mei ... Xinchen Yan
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dense Pixel-Level Interpretation of Dynamic Scenes With Video Panoptic Segmentation.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society