Detecting and Removing Visual Distractors for Video Aesthetic Enhancement

Fang-Lue Zhang,Shi-Min Hu,Jue Wang,Rui-Long Li,Zhao-Heng Zheng,Xian Wu

doi:10.1109/tmm.2018.2790163

Abstract

Personal videos often contain visual distractors, which are objects that are accidentally captured and can distract viewers from focusing on the main subjects. We propose a method to automatically detect and localize these distractors through learning from a manually labeled dataset. To achieve spatially and temporally coherent detection, we propose extracting features at the temporal-superpixel level using a traditional supporting vector machine based learning framework. We also experiment with end-to-end learning using convolutional neural networks, which achieves slightly higher performance than other methods. The classification result is further refined in a postprocessing step based on graph-cut optimization. Experimental results show that our method achieves an accuracy of 81% and a recall of 86%. We demonstrate several ways of removing the detected distractors to improve the video quality, including video hole filling, video frame replacement, and camera path replanning. The user study results show that our method can significantly improve the aesthetic quality of videos.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Multimedia	Publication Date: Aug 1, 2018
Citations: 73	License type: other-oa

R Discovery Prime

R Discovery Prime

Detecting and Removing Visual Distractors for Video Aesthetic Enhancement

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Similar Papers

Fine-Grained Classification of Product Images Based on Convolutional Neural Networks
Tongtong Liu ... Jimin Yang
Advances in Molecular Imaging | VOL. 08
Tongtong Liu, et. al.Tongtong Liu ... Jimin Yang
01 Jan 2018
Advances in Molecular Imaging | VOL. 08

A Gas Classification Algorithm of Electronic Noses Based on Convolutional Spiking Neural Network
Yizhou Xiong ... Yingying Xue
Electrochemical Society Meeting Abstracts | VOL. MA2021-01
Yizhou Xiong, et. al.Yizhou Xiong ... Yingying Xue
30 May 2021
Electrochemical Society Meeting Abstracts | VOL. MA2021-01

Deep learning-based classification and mutation prediction from histopathological images of hepatocellular carcinoma.
Haotian Liao ... Ruijiang Han
Clinical and Translational Medicine | VOL. 10
Haotian Liao, et. al.Haotian Liao ... Ruijiang Han
01 Jun 2020
Clinical and Translational Medicine | VOL. 10

Detection and Classification of Potato Diseases Potato Using a New Convolution Neural Network Architecture
Ali Arshaghi ... Leila Ghabeli
Traitement du Signal | VOL. 38
Ali Arshaghi, et. al.Ali Arshaghi ... Leila Ghabeli
31 Dec 2021
Traitement du Signal | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detecting and Removing Visual Distractors for Video Aesthetic Enhancement

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia