Learning When Agents Can Talk to Drivers Using the INAGT Dataset and Multisensor Fusion

Tong Wu,Nikolas Martelaro,Simon Stent,Jorge Ortiz,Wendy Ju

doi:10.1145/3478125

Abstract

This paper examines sensor fusion techniques for modeling opportunities for proactive speech-based in-car interfaces. We leverage the Is Now a Good Time (INAGT) dataset, which consists of automotive, physiological, and visual data collected from drivers who self-annotated responses to the question "Is now a good time?," indicating the opportunity to receive non-driving information during a 50-minute drive. We augment this original driver-annotated data with third-party annotations of perceived safety, in order to explore potential driver overconfidence. We show that fusing automotive, physiological, and visual data allows us to predict driver labels of availability, achieving an 0.874 F1-score by extracting statistically relevant features and training with our proposed deep neural network, PazNet. Using the same data and network, we achieve an 0.891 F1-score for predicting third-party labeled safe moments. We train these models to avoid false positives---determinations that it is a good time to interrupt when it is not---since false positives may cause driver distraction or service deactivation by the driver. Our analyses show that conservative models still leave many moments for interaction and show that most inopportune moments are short. This work lays a foundation for using sensor fusion models to predict when proactive speech systems should engage with drivers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning When Agents Can Talk to Drivers Using the INAGT Dataset and Multisensor Fusion

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies

Lead the way for us

Journal: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies	Publication Date: Sep 9, 2021
Citations: 15

Similar Papers

Information Integrity for Multi-sensors Data Fusion in Smart Mobility
Doaa Mohey El-Din ... Ehab E Hassanien
-
Doaa Mohey El-Din, et. al.Doaa Mohey El-Din ... Ehab E Hassanien
24 Jul 2019
24 Jul 2019

Preface to the special section on data fusion: Architectures and issues
M Kokar ... K Kim
Control Engineering Practice | VOL. 2
M Kokar, et. al.M Kokar ... K Kim
01 Oct 1994
Control Engineering Practice | VOL. 2

Classification of Driver Distraction: A Comprehensive Analysis of Feature Generation, Machine Learning, and Input Measures.
Anthony D Mcdonald ... Tyler A Wiener
Human Factors: The Journal of the Human Factors and Ergonomics Society | VOL. 62
Anthony D Mcdonald, et. al.Anthony D Mcdonald ... Tyler A Wiener
25 Jun 2019
Human Factors: The Journal of the Human Factors and Ergonomics Society | VOL. 62

Bidirectional visual-tactile cross-modal generation using latent feature space flow model
Yu Fang ... Jie Zhao
Neural Networks | VOL. 172
Yu Fang, et. al.Yu Fang ... Jie Zhao
27 Dec 2023
Neural Networks | VOL. 172

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning When Agents Can Talk to Drivers Using the INAGT Dataset and Multisensor Fusion

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies