Zero-Shot Deep Learning for Media Mining: Person Spotting and Face Clustering in Video Big Data

Mohamed Abdallah,Mohammad Ragab,Hyungwon Kim,Elsayed Hemayed

doi:10.3390/electronics8121394

Abstract

The analysis of frame sequences in talk show videos, which is necessary for media mining and television production, requires significant manual efforts and is a very time-consuming process. Given the vast amount of unlabeled face frames from talk show videos, we address and propose a solution to the problem of recognizing and clustering faces. In this paper, we propose a TV media mining system that is based on a deep convolutional neural network approach, which has been trained with a triplet loss minimization method. The main function of the proposed system is the indexing and clustering of video data for achieving an effective media production analysis of individuals in talk show videos and rapidly identifying a specific individual in video data in real-time processing. Our system uses several face datasets from Labeled Faces in the Wild (LFW), which is a collection of unlabeled web face images, as well as YouTube Faces and talk show faces datasets. In the recognition (person spotting) task, our system achieves an F-measure of 0.996 for the collection of unlabeled web face images dataset and an F-measure of 0.972 for the talk show faces dataset. In the clustering task, our system achieves an F-measure of 0.764 and 0.935 for the YouTube Faces database and the LFW dataset, respectively, while achieving an F-measure of 0.832 for the talk show faces dataset, an improvement of 5.4%, 6.5%, and 8.2% over the previous methods.

Highlights

Many methods have been studied to achieve the target of producing, processing, and recording of talk show videos in an effective way
We present a TV media mining system that is based on deep convolutional neural networks (DCNNs) algorithms for face detection, face recognition, and face clustering
Face recognition and clustering approaches that are based on DCNN require a large volume of data and large face dataset for training

Summary

Introduction

Many methods have been studied to achieve the target of producing, processing, and recording of talk show videos in an effective way. A meaningful analysis of media content requires substantial manual efforts. This problem is encountered in TV production analysis and media mining applications, where the number of faces of individuals can be on the order of millions. Many talk show hours are broadcasted daily. The majority of these talk shows contain millions of frames. We consider clustering these large amounts of face images into a few hundred discrete identities to properly organize these vast amounts of data. A frame-based analysis is needed to make talk show videos searchable for identities (public figures) and useful for media mining and TV production analysis

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Nov 22, 2019
Citations: 8	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Zero-Shot Deep Learning for Media Mining: Person Spotting and Face Clustering in Video Big Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Unconstrained face verification using deep CNN features
Jun-Cheng Chen ... Rama Chellappa
-
Jun-Cheng Chen, et. al.Jun-Cheng Chen ... Rama Chellappa
01 Mar 2016
01 Mar 2016

Deep convolutional neural network approach for forehead tissue thickness estimation
Jirapong Manit ... Achim Schweikard
Current Directions in Biomedical Engineering | VOL. 3
Jirapong Manit, et. al.Jirapong Manit ... Achim Schweikard
07 Sep 2017
Current Directions in Biomedical Engineering | VOL. 3

Scalable softmax loss for face verification
Kun Zhang ... Dongping Zhang
-
Kun Zhang, et. al.Kun Zhang ... Dongping Zhang
01 Nov 2017
01 Nov 2017

Effect of Laplacian Smoothing Stochastic Gradient Descent with Angular Margin Softmax Loss on Face Recognition
Mansoor Iqbal ... Muhammad Awais Rehman
-
Mansoor Iqbal, et. al.Mansoor Iqbal ... Muhammad Awais Rehman
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Zero-Shot Deep Learning for Media Mining: Person Spotting and Face Clustering in Video Big Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics