Learning Video Object Segmentation with Visual Memory

Pavel Tokmakov,Karteek Alahari,Cordelia Schmid

doi:10.1109/iccv.2017.480

Abstract

This paper addresses the task of segmenting moving objects in unconstrained videos. We introduce a novel two-stream neural network with an explicit module to achieve this. The two streams of the network encode spatial and temporal features in a video sequence respectively, while the module captures the evolution of objects over time. The module to build a “visual memory” in video, i.e., a joint representation of all the video frames, is realized with a convolutional recurrent unit learned from a small number of training video sequences. Given a video frame as input, our approach assigns each pixel an object or background label based on the learned spatio-temporal features as well as the memory specific to the video, acquired automatically without any manually-annotated frames. The visual is implemented with convolutional gated recurrent units, which allows to propagate spatial information over time. We evaluate our method extensively on two benchmarks, DAVIS and Freiburg-Berkeley motion segmentation datasets, and show state-of-the-art results. For example, our approach outperforms the top method on the DAVIS dataset by nearly 6%. We also provide an extensive ablative analysis to investigate the influence of each component in the proposed framework.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Video Object Segmentation with Visual Memory

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Learning to Segment Moving Objects
Pavel Tokmakov ... Karteek Alahari
International Journal of Computer Vision | VOL. 127
Pavel Tokmakov, et. al.Pavel Tokmakov ... Karteek Alahari
22 Sep 2018
International Journal of Computer Vision | VOL. 127

Joint Video Object Discovery and Segmentation by Coupled Dynamic Markov Networks.
Ziyi Liu ... Qilin Zhang
IEEE Transactions on Image Processing | VOL. 27
Ziyi Liu, et. al.Ziyi Liu ... Qilin Zhang
30 Jul 2018
IEEE Transactions on Image Processing | VOL. 27

Learning Affective Video Features for Facial Expression Recognition via Hybrid Deep Learning
Shiqing Zhang ... Limei Liu
IEEE Access | VOL. 7
Shiqing Zhang, et. al.Shiqing Zhang ... Limei Liu
01 Jan 2019
IEEE Access | VOL. 7

Spectral and Temporal Feature Learning With Two-Stream Neural Networks for Mental Workload Assessment.
Pengbo Zhang ... Wei You
IEEE Transactions on Neural Systems and Rehabilitation Engineering | VOL. 27
Pengbo Zhang, et. al.Pengbo Zhang ... Wei You
26 Apr 2019
IEEE Transactions on Neural Systems and Rehabilitation Engineering | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Video Object Segmentation with Visual Memory

Abstract

Talk to us

Similar Papers