Film segmentation and indexing using autoassociative neural networks

K Sreenivasa Rao,Shashidhar G Koolagudi,Dipanjan Nandi

doi:10.1007/s10772-013-9206-4

Abstract

In this paper, Autoassociative Neural Network (AANN) models are explored for segmentation and indexing the films (movies) using audio features. A two-stage method is proposed for segmenting the film into sequence of scenes, and then indexing them appropriately. In the first stage, music and speech plus music segments of the film are separated, and music segments are labelled as title and fighting scenes based on their position. At the second stage, speech plus music segments are classified into normal, emotional, comedy and song scenes. In this work, Mel frequency cepstral coefficients (MFCCs), zero crossing rate and intensity are used as audio features for segmentation and indexing the films. The proposed segmentation and indexing method is evaluated on manual segmented Hindi films. From the evaluation results, it is observed that title, fighting and song scenes are segmented and indexed without any errors, and most of the errors are observed in discriminating the comedy and normal scenes. Performance of the proposed AANN models used for segmentation and indexing of the films, is also compared with hidden Markov models, Gaussian mixture models and support vector machines.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Film segmentation and indexing using autoassociative neural networks

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Journal: International Journal of Speech Technology	Publication Date: Aug 28, 2013
Citations: 2

Similar Papers

Classification of sport videos using edge-based features and autoassociative neural network models
C Krishna Mohan ... B Yegnanarayana
Signal, Image and Video Processing | VOL. 4
C Krishna Mohan, et. al.C Krishna Mohan ... B Yegnanarayana
10 Dec 2008
Signal, Image and Video Processing | VOL. 4

Analysis of Throat Microphone Using MFCC Features for Speaker Recognition
R Visalakshi ... P Dhanalakshmi
-
R Visalakshi, et. al.R Visalakshi ... P Dhanalakshmi
19 Dec 2015
19 Dec 2015

A study of Spoken Word Recognition using Unsupervised Learning with reference to Assamese Language
Dipen Nath ... Sanjib Kr Kalita
-
Dipen Nath, et. al.Dipen Nath ... Sanjib Kr Kalita
01 Mar 2019
01 Mar 2019

Speaker-specific information from residual phase
K Sri Rama Murty ... B Yegnanarayana
-
K Sri Rama Murty, et. al.K Sri Rama Murty ... B Yegnanarayana
11 Dec 2004
11 Dec 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Film segmentation and indexing using autoassociative neural networks

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology