Complementary Coarse-to-Fine Matching for Video Object Segmentation

Zhen Chen,Ming Yang,Shiliang Zhang

doi:10.1145/3596496

Abstract

Semi-supervised Video Object Segmentation (VOS) needs to establish pixel-level correspondences between a video frame and preceding segmented frames to leverage their segmentation clues. Most works rely on features at a single scale to establish those correspondences, e.g., perform dense matching with Convolutional Neural Network (CNN) features from a deep layer. Differently, this work explores complementary features at different scales to pursue more robust feature matching. A coarse feature from a deep layer is first adopted to get coarse pixel-level correspondences. We hence evaluate the quality of those correspondences, and select pixels with low-quality correspondences for fine-scale feature matching. Segmentation clues of previous frames are propagated by both coarse and fine-scale correspondences, which are fused with appearance features for object segmentation. Compared with previous works, this coarse-to-fine matching scheme is more robust to distractions by similar objects and better preserves object details. The sparse fine-scale matching also ensures a fast inference speed. On popular VOS datasets including DAVIS and YouTube-VOS, the proposed method shows promising performance compared with recent works.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Complementary Coarse-to-Fine Matching for Video Object Segmentation

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Similar Papers

Divided attention
-
Electronics Letters | VOL. 55
--
01 Apr 2019
Electronics Letters | VOL. 55

Semi-supervised Video Object Segmentation with Recurrent Neural Network
Xuanguang Ren ... Zhongliang Jing
-
Xuanguang Ren, et. al.Xuanguang Ren ... Zhongliang Jing
01 Sep 2019
01 Sep 2019

Hierarchical Embedding Guided Network for Video Object Segmentation
Chin-Hsuan Shih ... Wen-Jiin Tsai
-
Chin-Hsuan Shih, et. al.Chin-Hsuan Shih ... Wen-Jiin Tsai
19 Sep 2021
19 Sep 2021

Integrating SIFT and CNN Feature Matching for Partial-Duplicate Image Detection
Zhili Zhou ... Shaohua Wan
IEEE Transactions on Emerging Topics in Computational Intelligence | VOL. 4
Zhili Zhou, et. al.Zhili Zhou ... Shaohua Wan
19 Jun 2020
IEEE Transactions on Emerging Topics in Computational Intelligence | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Complementary Coarse-to-Fine Matching for Video Object Segmentation

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications