Multimodal Egocentric Analysis of Focused Interactions

Sophia Bano,Tamas Suveges,Jianguo Zhang,Stephen J Mckenna

doi:10.1109/access.2018.2850284

Sophia Bano, Tamas Suveges + Show 2 more

Open Access

https://doi.org/10.1109/access.2018.2850284

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2018
Citations: 53	License type: CC BY 3.0

Affiliation: University of Dundee

Abstract

Continuous detection of social interactions from wearable sensor data streams has a range of potential applications in domains, including health and social care, security, and assistive technology. We contribute an annotated, multimodal data set capturing such interactions using video, audio, GPS, and inertial sensing. We present methods for automatic detection and temporal segmentation of focused interactions using support vector machines and recurrent neural networks with features extracted from both audio and video streams. The focused interaction occurs when the co-present individuals, having the mutual focus of attention, interact by first establishing the face-to-face engagement and direct conversation. We describe an evaluation protocol, including framewise, extended framewise, and event-based measures, and provide empirical evidence that the fusion of visual face track scores with audio voice activity scores provides an effective combination. The methods, contributed data set, and protocol together provide a benchmark for the future research on this problem. The data set is available at https://doi.org/10.15132/10000134 .

Highlights

We consider automatic detection of social interactions by analysis of wearable sensor data
We report results for detecting focused interactions using more data, temporal filtering, and Long Short-Term Memory (LSTM) recurrent neural networks as well as Support Vector Machines (SVMs) using audio-only, video-only, and audio-visual features
By analysing the performance of both SVM and LSTM-Recurrent Neural Networks (RNNs) with audio, visual, and audio-visual features, we aim to obtain a deeper understanding of our application and dataset, and to provide more comprehensive benchmarking for future research

Summary

Introduction

We consider automatic detection of social interactions by analysis of wearable sensor data. Focused interaction occurs when two or more co-present individuals, having mutual focus of attention, interact by establishing face-to-face engagement and direct conversation [1]. Face-to-face engagement is often not maintained throughout the entirety of a focused interaction; for example a group of people talking while in conversation will typically look at each other only intermittently. This concept of focused interaction is more specific than that of social interaction which can be considered to occur whenever individuals communicate and interact with one another whether or not they are physically co-present, e.g. by telephone [2]. Individuals in an unfocused interaction are aware of each others’ presence but establish only indirect engagement which might involve brief eye contact, or facial expressions for example

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multimodal Egocentric Analysis of Focused Interactions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Using support vector machines for time series prediction
U Thissen ... L.M.C Buydens
Chemometrics and Intelligent Laboratory Systems | VOL. 69
U Thissen, et. al.U Thissen ... L.M.C Buydens
15 Aug 2003
Chemometrics and Intelligent Laboratory Systems | VOL. 69

<title>Synchronization-sensitive motion and interpolation-based frame estimation in video lossy transmission</title>
Sherif G Aly ... Abdou Youssef
-
Sherif G Aly, et. al.Sherif G Aly ... Abdou Youssef
22 Mar 2001
22 Mar 2001

Finding Time Together: Detection and Classification of Focused Interaction in Egocentric Video
Sophia Bano ... Stephen J Mckenna
-
Sophia Bano, et. al.Sophia Bano ... Stephen J Mckenna
01 Oct 2017
01 Oct 2017

A novel framework for wind speed prediction based on recurrent neural networks and support vector machine
Chuanjin Yu ... Guanghao Zhai
Energy Conversion and Management | VOL. 178
Chuanjin Yu, et. al.Chuanjin Yu ... Guanghao Zhai
16 Oct 2018
Energy Conversion and Management | VOL. 178

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multimodal Egocentric Analysis of Focused Interactions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access