Enhancing smart home interaction through multimodal command disambiguation

Tommaso Calò,Luigi De Russis

doi:10.1007/s00779-024-01827-3

Abstract

AbstractSmart speakers are entering our homes and enriching the connected ecosystem already present in them. Home inhabitants can use those to execute relatively simple commands, e.g., turning a lamp on. Their capabilities to interpret more complex and ambiguous commands (e.g., make this room warmer) are limited, if not absent. Large language models (LLMs) can offer creative and viable solutions to enable a practical and user-acceptable interpretation of such ambiguous commands. This paper introduces an interactive disambiguation approach that integrates visual and textual cues with natural language commands. After contextualizing the approach with a use case, we test it in an experiment where users are prompted to select the appropriate cue (an image or a textual description) to clarify ambiguous commands, thereby refining the accuracy of the system’s interpretations. Outcomes from the study indicate that the disambiguation system produces responses well-aligned with user intentions, and that participants found the textual descriptions slightly more effective. Finally, interviews reveal heightened satisfaction with the smart-home system when engaging with the proposed disambiguation approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing smart home interaction through multimodal command disambiguation

Abstract

Talk to us

Similar Papers

More From: Personal and Ubiquitous Computing

Lead the way for us

Journal: Personal and Ubiquitous Computing	Publication Date: Jul 22, 2024
License type: CC BY 4.0

Similar Papers

An Investigation on Utilizing Large Language Model for Industrial Computer-Aided Design Automation
Haoxuan Deng ... John Ahmet Erkoyuncu
Procedia CIRP | VOL. 128
Haoxuan Deng, et. al.Haoxuan Deng ... John Ahmet Erkoyuncu
01 Jan 2024
Procedia CIRP | VOL. 128

Grounding Verbs of Motion in Natural Language Commands to Robots
Thomas Kollar ... Deb Roy
-
Thomas Kollar, et. al.Thomas Kollar ... Deb Roy
01 Jan 2014
01 Jan 2014

Open Source Platform Digital Personal Assistant
Azat Khusnutdinov ... Denis Usachev
-
Azat Khusnutdinov, et. al.Azat Khusnutdinov ... Denis Usachev
01 May 2018
01 May 2018

Multi-Label Classification of Daily Drill Reports (DDR) Utilizing Large Language Models (LLMs)
Wajih Asif ... Nouf Al Noufli
-
Wajih Asif, et. al.Wajih Asif ... Nouf Al Noufli
04 Nov 2024
04 Nov 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing smart home interaction through multimodal command disambiguation

Abstract

Talk to us

Similar Papers

More From: Personal and Ubiquitous Computing