Learning to Segment Moving Objects

Pavel Tokmakov,Cordelia Schmid,Karteek Alahari

doi:10.1007/s11263-018-1122-2

Abstract

We study the problem of segmenting moving objects in unconstrained videos. Given a video, the task is to segment all the objects that exhibit independent motion in at least one frame. We formulate this as a learning problem and design our framework with three cues: (i) independent object motion between a pair of frames, which complements object recognition, (ii) object appearance, which helps to correct errors in motion estimation, and (iii) temporal consistency, which imposes additional constraints on the segmentation. The framework is a two-stream neural network with an explicit memory module. The two streams encode appearance and motion cues in a video sequence respectively , while the memory module captures the evolution of objects over time, exploiting the temporal consistency. The motion stream is a convolutional neural network trained on synthetic videos to segment independently moving objects in the optical flow field. The module to build a 'visual memory' in video, i.e., a joint representation of all the video frames, is realized with a convolutional recurrent unit learned from a small number of training video sequences. For every pixel in a frame of a test video, our approach assigns an object or background label based on the learned spatio-temporal features as well as the 'visual memory' specific to the video. We evaluate our method extensively on three benchmarks, DAVIS, Freiburg-Berkeley motion seg-mentation dataset and SegTrack. In addition, we provide an extensive ablation study to investigate both the choice of the training data and the influence of each component in the proposed framework.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning to Segment Moving Objects

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Vision

Lead the way for us

Journal: International Journal of Computer Vision	Publication Date: Sep 22, 2018
Citations: 96

Similar Papers

Learning Video Object Segmentation with Visual Memory
Pavel Tokmakov ... Cordelia Schmid
-
Pavel Tokmakov, et. al.Pavel Tokmakov ... Cordelia Schmid
10 Aug 2017
10 Aug 2017

MOCA: Memory Object Classification and Allocation in Heterogeneous Memory Systems
Aditya Narayan ... Shaizeen Aga
-
Aditya Narayan, et. al.Aditya Narayan ... Shaizeen Aga
01 May 2018
01 May 2018

Modification of spatial recognition memory and object discrimination after chronic administration of haloperidol, amitriptyline, sodium valproate or olanzapine in normal and anhedonic rats
Marco Orsetti ... Piera Ghi
The International Journal of Neuropsychopharmacology | VOL. 10
Marco Orsetti, et. al.Marco Orsetti ... Piera Ghi
31 May 2006
The International Journal of Neuropsychopharmacology | VOL. 10

Subpixel Motion Estimation for Super-Resolution Image Sequence Enhancement
Richard R Schultz ... Robert L Stevenson
Journal of Visual Communication and Image Representation | VOL. 9
Richard R Schultz, et. al.Richard R Schultz ... Robert L Stevenson
01 Mar 1998
Journal of Visual Communication and Image Representation | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning to Segment Moving Objects

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Vision