Classification of extreme facial events in sign language videos

Epameinondas Antonakos,Petros Maragos,Vassilis Pitsikalis

doi:10.1186/1687-5281-2014-14

Epameinondas Antonakos, Petros Maragos + Show 1 more

Open Access

https://doi.org/10.1186/1687-5281-2014-14

Copy DOI

Abstract

We propose a new approach for Extreme States Classification (ESC) on feature spaces of facial cues in sign language (SL) videos. The method is built upon Active Appearance Model (AAM) face tracking and feature extraction of global and local AAMs. ESC is applied on various facial cues - as, for instance, pose rotations, head movements and eye blinking - leading to the detection of extreme states such as left/right, up/down and open/closed. Given the importance of such facial events in SL analysis, we apply ESC to detect visual events on SL videos, including both American (ASL) and Greek (GSL) corpora, yielding promising qualitative and quantitative results. Further, we show the potential of ESC for assistive annotation tools and demonstrate a link of the detections with indicative higher-level linguistic events. Given the lack of facial annotated data and the fact that manual annotations are highly time-consuming, ESC results indicate that the framework can have significant impact on SL processing and analysis.

Highlights

Facial events are inevitably linked with human communication and are more than essential for gesture and sign language (SL) comprehension
6 Experimental results we present qualitative results on Greek sign language (GSL) (Section 6.1) which lacks annotations, a quantitative comparison between Extreme States Classification (ESC), supervised classification and k-means clustering on American sign language (ASL) (Section 6.2), a quantitative testing of the effect of Appearance Model (AAM) fitting accuracy on ESC performance (Section 6.3) and a subject-independent application on IMM (Section 6.4)
Even though the task is easier - IMM data has more clear extreme poses than SL videos - these results indicate that ESC is subject independent

Summary

Introduction

Facial events are inevitably linked with human communication and are more than essential for gesture and sign language (SL) comprehension. Both from the automatic visual processing and the recognition viewpoint, facial events are difficult to detect, describe and model. We focus on the detection of such low-level visual events in video sequences which can be proved important both for SL analysis and for automatic SL recognition (ASLR) [5,6]. SL video corpora are widely employed by linguists, annotators and computer scientists for the study of SL and the training of ASLR systems. All the above require manual annotation of facial events, either for linguistic analysis or for ground truth transcriptions. All the above led on efforts towards the development of automatic or semiautomatic annotation tools [12,13,14] for the processing of corpora

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Image and Video Processing	Publication Date: Mar 13, 2014
Citations: 33	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Classification of extreme facial events in sign language videos

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Image and Video Processing

Lead the way for us

Similar Papers

Unsupervised classification of extreme facial events using active appearance models tracking for sign language videos
Epameinondas Antonakos ... Vassilis Pitsikalis
-
Epameinondas Antonakos, et. al.Epameinondas Antonakos ... Vassilis Pitsikalis
01 Sep 2012
01 Sep 2012

Sign Language Recognition and Video Generation Using Deep Learning
Meera Treesa Mathews ... Paul J Puthusserry
Journal of Applied Science, Engineering, Technology and Management | VOL. 1
Meera Treesa Mathews, et. al.Meera Treesa Mathews ... Paul J Puthusserry
02 Dec 2023
Journal of Applied Science, Engineering, Technology and Management | VOL. 1

Identifying Sign Language Videos in Video Sharing Sites
Frank M Shipman ... Caio D D Monteiro
ACM Transactions on Accessible Computing | VOL. 5
Frank M Shipman, et. al.Frank M Shipman ... Caio D D Monteiro
01 Mar 2014
ACM Transactions on Accessible Computing | VOL. 5

Occluded Facial Expression Tracking
Hugo Mercier ... Patrice Dalle
-
Hugo Mercier, et. al.Hugo Mercier ... Patrice Dalle
10 Jun 2007
10 Jun 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Classification of extreme facial events in sign language videos

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Image and Video Processing