Action Anticipation from Multimodal Data

Giovanni Farinella,Sebastiano Battiato,Tiziana Rotondo,Valeria Tomaselli

doi:10.5220/0007379001540161

Abstract

The idea of multi-sensor data fusion is to combine the data coming from different sensors to provide more accurate and complementary information to solve a specific task. Our goal is to build a shared representation related to data coming from different domains, such as images, audio signal, heart rate, acceleration, etc., in order to anticipate daily activities of a user wearing multimodal sensors. To this aim, we consider the Stanford-ECM Dataset which contains syncronized data acquired with different sensors: video, acceleration and heart rate signals. The dataset is adapted to our action prediction task by identifying the transitions from the generic “Unknown” class to a specific “Activity”. We discuss and compare a Siamese Network with the Multi Layer Perceptron and the 1D CNN where the input is an unknown observation and the output is the next activity to be observed. The feature representations obtained with the considered deep architecture are classified with SVM or KNN classifiers. Experimental results pointed out that prediction from multimodal data seems a feasible task, suggesting that multimodality improves both classification and prediction. Nevertheless, the task of reliably predicting next actions is still open and requires more investigations as well as the availability of multimodal dataset, specifically built for prediction purposes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Action Anticipation from Multimodal Data

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2019
Citations: 3	License type: cc-by-nc-nd

Similar Papers

Automatic Non-Invasive Cough Detection based on Accelerometer and Audio Signals.
Madhurananda Pahar ... Igor Miranda
Journal of Signal Processing Systems | VOL. 94
Madhurananda Pahar, et. al.Madhurananda Pahar ... Igor Miranda
19 Mar 2022
Journal of Signal Processing Systems | VOL. 94

Boosting and Decreasing Action Prediction Abilities Through Excitatory and Inhibitory tDCS of Inferior Frontal Cortex.
Alessio Avenanti ... Riccardo Paracampo
Cerebral Cortex | VOL. 28
Alessio Avenanti, et. al.Alessio Avenanti ... Riccardo Paracampo
14 Mar 2017
Cerebral Cortex | VOL. 28

Hybrid system for automatic detection of gunshots in indoor environment
Sami Ur Rahman ... Adnan Khan
Multimedia Tools and Applications | VOL. 80
Sami Ur Rahman, et. al.Sami Ur Rahman ... Adnan Khan
26 Sep 2020
Multimedia Tools and Applications | VOL. 80

Jointly Learning Energy Expenditures and Activities Using Egocentric Multimodal Signals
Katsuyuki Nakamura ... Li Fei-Fei
-
Katsuyuki Nakamura, et. al.Katsuyuki Nakamura ... Li Fei-Fei
01 Jul 2017
01 Jul 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Action Anticipation from Multimodal Data

Abstract

Talk to us

Similar Papers