Human action prediction in collaborative environments based on shared-weight LSTMs with feature dimensionality reduction

Tomislav Petković,Luka Petrović,Ivan Marković,Ivan Petrović

doi:10.1016/j.asoc.2022.109245

Abstract

As robots are progressing towards being ubiquitous and an indispensable part of our everyday environments, such as home, offices, healthcare, education, and manufacturing shop floors, efficient and safe collaboration and cohabitation become imperative. Given that, such environments could benefit greatly from accurate human action prediction. In addition to being accurate, human action prediction should be computationally efficient, in order to ensure a timely reaction, and capable of dealing with changing environments, since unstructured interaction and collaboration with humans usually do not assume static conditions. In this paper, we propose a model for human action prediction based on motion cues and gaze using shared-weight Long Short-Term Memory networks (LSTMs) and feature dimensionality reduction. LSTMs have proven to be a powerful tool in processing time series data, especially when dealing with long-term dependencies; however, to maximize their performance, LSTM networks should be fed with informative and quality inputs. Given that, in this paper, we furthermore conducted an extensive input feature analysis based on (i) signal correlation and their strength to act as stand-alone predictors, and (ii) a multilayer perceptron inspired by the autoencoder architecture. We validated the proposed model on a publicly available MoGaze11https://humans-to-robots-motion.github.io/mogaze/. dataset for human action prediction, as well as on a smaller dataset recorded in our laboratory. Our model outperformed alternatives, such as recurrent neural networks, a fully connected LSTM network, and the strongest stand-alone signals (baselines), and can run in real-time on a standard laptop CPU. Since eye gaze might not always be available in a real-world scenario, we have implemented and tested a multi-layer perceptron for gaze estimation from more easily obtainable motion cues, such as head orientation and hand position. The estimated gaze signal can be utilized during inference of our LSTM-based model, thus making our action prediction pipeline suitable for real-time practical applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Human action prediction in collaborative environments based on shared-weight LSTMs with feature dimensionality reduction

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Journal: Applied Soft Computing	Publication Date: Jun 30, 2022
Citations: 7

Similar Papers

A Novel Long- and Short-Term Memory Network with Time Series Data Analysis Capabilities
Mu Qiao ... Zixuan Cheng
Mathematical Problems in Engineering | VOL. 2020
Mu Qiao, et. al.Mu Qiao ... Zixuan Cheng
13 Oct 2020
Mathematical Problems in Engineering | VOL. 2020

How embedded memory in recurrent neural network architectures helps learning long-term temporal dependencies
Tsungnan Lin ... C.Lee Giles
Neural Networks | VOL. 11
Tsungnan Lin, et. al.Tsungnan Lin ... C.Lee Giles
01 Jul 1998
Neural Networks | VOL. 11

ACR-SA: attention-based deep model through two-channel CNN and Bi-RNN for sentiment analysis.
Marjan Kamyab ... Abdur Rasool
PeerJ Computer Science | VOL. 8
Marjan Kamyab, et. al.Marjan Kamyab ... Abdur Rasool
17 Mar 2022
PeerJ Computer Science | VOL. 8

A Review of Weight Optimization Techniques in Recurrent Neural Networks
Alawi Alqushaibi ... Qasem Al-Tashi
-
Alawi Alqushaibi, et. al.Alawi Alqushaibi ... Qasem Al-Tashi
08 Oct 2020
08 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Human action prediction in collaborative environments based on shared-weight LSTMs with feature dimensionality reduction

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing