G-VOILA: Gaze-Facilitated Information Querying in Daily Scenarios

Zeyu Wang,Kun Yan,Yuhan Wang,Yuanchun Shi,Lei Ji,Yuntao Wang,Xuhai Xu,Yuchen Yao,Chun Yu

doi:10.1145/3659623

Abstract

Modern information querying systems are progressively incorporating multimodal inputs like vision and audio. However, the integration of gaze --- a modality deeply linked to user intent and increasingly accessible via gaze-tracking wearables --- remains underexplored. This paper introduces a novel gaze-facilitated information querying paradigm, named G-VOILA, which synergizes users' gaze, visual field, and voice-based natural language queries to facilitate a more intuitive querying process. In a user-enactment study involving 21 participants in 3 daily scenarios (p = 21, scene = 3), we revealed the ambiguity in users' query language and a gaze-voice coordination pattern in users' natural query behaviors with G-VOILA. Based on the quantitative and qualitative findings, we developed a design framework for the G-VOILA paradigm, which effectively integrates the gaze data with the in-situ querying context. Then we implemented a G-VOILA proof-of-concept using cutting-edge deep learning techniques. A follow-up user study (p = 16, scene = 2) demonstrates its effectiveness by achieving both higher objective score and subjective score, compared to a baseline without gaze data. We further conducted interviews and provided insights for future gaze-facilitated information querying systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

G-VOILA: Gaze-Facilitated Information Querying in Daily Scenarios

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies

Lead the way for us

Journal: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies	Publication Date: May 13, 2024
License type: cc-by-nc

Similar Papers

Editor's evaluation: Retinal motion statistics during natural locomotion
Krystel R Huxlin
-
Krystel R HuxlinKrystel R Huxlin
18 Oct 2022
18 Oct 2022

Author response: Spherical arena reveals optokinetic response tuning to stimulus location, size, and frequency across entire visual field of larval zebrafish
Florian A Dehmelt ... Tom Baden
-
Florian A Dehmelt, et. al.Florian A Dehmelt ... Tom Baden
08 Apr 2021
08 Apr 2021

Analysing the deep structure of queries: Transfer effect on learning a query language
Lena Linde ... Monica Bergström
Acta Psychologica | VOL. 78
Lena Linde, et. al.Lena Linde ... Monica Bergström
01 Dec 1991
Acta Psychologica | VOL. 78

Query languages for the casual user
William C Ogden ... Susan R Brooks
-
William C Ogden, et. al.William C Ogden ... Susan R Brooks
01 Jan 1982
01 Jan 1982

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

G-VOILA: Gaze-Facilitated Information Querying in Daily Scenarios

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies