An analysis of Convolutional Long Short-Term Memory Recurrent Neural Networks for gesture recognition

Eleni Tsironi,Pablo Barros,Cornelius Weber,Stefan Wermter

doi:10.1016/j.neucom.2016.12.088

Eleni Tsironi, Pablo Barros + Show 2 more

Open Access

https://doi.org/10.1016/j.neucom.2016.12.088

Copy DOI

Journal: Neurocomputing	Publication Date: May 2, 2017
Citations: 210	License type: cc-by-nc-nd

Affiliation: Universität Hamburg

Abstract

In this research, we analyze a Convolutional Long Short-Term Memory Recurrent Neural Network (CNNLSTM) in the context of gesture recognition. CNNLSTMs are able to successfully learn gestures of varying duration and complexity. For this reason, we analyze the architecture by presenting a qualitative evaluation of the model, based on the visualization of the internal representations of the convolutional layers and on the examination of the temporal classification outputs at a frame level, in order to check if they match the cognitive perception of a gesture. We show that CNNLSTM learns the temporal evolution of the gestures classifying correctly their meaningful part, known as Kendon’s stroke phase. With the visualization, for which we use the deconvolution process that maps specific feature map activations to original image pixels, we show that the network learns to detect the most intense body motion. Finally, we show that CNNLSTM outperforms both plain CNN and LSTM in gesture recognition.

Full Text