Untethered gesture acquisition and recognition for virtual world manipulation

David Demirdjian,Teresa Ko,Trevor Darrell

doi:10.1007/s10055-005-0155-3

David Demirdjian, Teresa Ko + Show 1 more

Open Access

https://doi.org/10.1007/s10055-005-0155-3

Copy DOI

Abstract

Humans use a combination of gesture and speech to interact with objects and usually do so more naturally without holding a device or pointer. We present a system that incorporates user body-pose estimation, gesture recognition and speech recognition for interaction in virtual reality environments. We describe a vision-based method for tracking the pose of a user in real time and introduce a technique that provides parameterized gesture recognition. More precisely, we train a support vector classifier to model the boundary of the space of possible gestures, and train Hidden Markov Models (HMM) on specific gestures. Given a sequence, we can find the start and end of various gestures using a support vector classifier, and find gesture likelihoods and parameters with a HMM. A multimodal recognition process is performed using rank-order fusion to merge speech and vision hypotheses. Finally we describe the use of our multimodal framework in a virtual world application that allows users to interact using gestures and speech.

Full Text