Object removal from complex videos using a few annotations

Simon Masnou,Andrés Almansa,Thuc Trinh Le,Yann Gousseau

doi:10.1007/s41095-019-0145-0

Abstract

We present a system for the removal of objects from videos. As input, the system only needs a user to draw a few strokes on the first frame, roughly delimiting the objects to be removed. To the best of our knowledge, this is the first system allowing the semi-automatic removal of objects from videos with complex backgrounds. The key steps of our system are the following: after initialization, segmentation masks are first refined and then automatically propagated through the video. Missing regions are then synthesized using video inpainting techniques. Our system can deal with multiple, possibly crossing objects, with complex motions, and with dynamic textures. This results in a computational tool that can alleviate tedious manual operations for editing high-quality videos.

Highlights

In this paper, we propose a system to remove one or more objects from a video, starting with only a few user annotations
We evaluate our method on various datasets, for both object segmentation and object removal
The process of extracting space–time segments corresponding to objects, is a widely studied topic whose complete review is beyond the scope of this paper

Summary

Introduction

We propose a system to remove one or more objects from a video, starting with only a few user annotations. We use a classical strategy relying on a CNN-based edge detector, followed by a watershed transform yielding super-pixels, which are eventually selected by the user to refine the segmentation mask After this step, a label is given to each object. We employ two strategies: motion-based pixel propagation for the static background, and patch-based video completion for dynamic textures Both methods rely heavily on the knowledge of segmented objects. This interplay between object segmentation and the completion scheme improves the method in many ways: it allows for better video stabilization, for faster and more accurate search for similar patches, and for more accurate foreground–background separation These improvements yield completion results with very little or no temporal incoherence. A shorter version of this work can be found in Ref. [2]

Video object segmentation

Video editing

Video inpainting

Proposed method

First frame annotation

Object segmentation

Semantic segmentation networks

Multiple object tracking

Object removal

Dynamic background

Results

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computational Visual Media	Publication Date: Aug 22, 2019
Citations: 14	License type: open-access

R Discovery Prime

R Discovery Prime

Object removal from complex videos using a few annotations

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational Visual Media

Lead the way for us

Similar Papers

Removing objects from videos with a few strokes
Yann Gousseau ... Thuc Trinh Le
-
Yann Gousseau, et. al.Yann Gousseau ... Thuc Trinh Le
04 Dec 2018
04 Dec 2018

A survery: Recent trends in video in-painting techniques
Bhavik V Batra ... Tushar V Ratanpara
-
Bhavik V Batra, et. al.Bhavik V Batra ... Tushar V Ratanpara
01 Jan 2015
01 Jan 2015

Robust description and recognition of various viewpoint dynamic textures
Wang Qian
Journal of Systems Engineering and Electronics | VOL. 27
Wang QianWang Qian
24 Aug 2016
Journal of Systems Engineering and Electronics | VOL. 27

Koopman operator based nonlinear dynamic textures
Amit Surana
-
Amit SuranaAmit Surana
01 Jul 2015
01 Jul 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Object removal from complex videos using a few annotations

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational Visual Media